docs: add API reference for CollectorRegistry and custom collector classes by k1chik · Pull Request #1169 · prometheus/client_python

k1chik · 2026-04-24T15:14:09Z

Follows the same pattern as #1021 and #1162.

collector/custom.md: Collector protocol section (collect/describe), value vs labels mutual exclusivity note with correct per-type parameter names, full constructor and add_metric tables for GaugeMetricFamily, CounterMetricFamily, SummaryMetricFamily, HistogramMetricFamily, and InfoMetricFamily, plus a runnable real-world example.

collector/_index.md: constructor parameter tables for ProcessCollector, PlatformCollector, and GCCollector, with exported metrics listed for each.

registry/_index.md (new page): CollectorRegistry constructor and all public methods (register, unregister, collect, restricted_registry, get_sample_value, set_target_info, get_target_info), the global REGISTRY instance, and examples for isolated registry usage and registry=None.

All code examples verified by running them in Python.

cc @csmarchbanks

…asses Closes prometheus#1163 collector/custom.md: Collector protocol section (collect/describe), value vs labels mutual exclusivity note, full constructor and add_metric tables for GaugeMetricFamily, CounterMetricFamily, SummaryMetricFamily, HistogramMetricFamily, and InfoMetricFamily, plus a runnable real-world example. collector/_index.md: constructor parameter tables for ProcessCollector, PlatformCollector, and GCCollector, with exported metrics listed for each. registry/_index.md (new): CollectorRegistry constructor and all public methods (register, unregister, collect, restricted_registry, get_sample_value, set_target_info, get_target_info), the global REGISTRY instance, and examples for isolated registry usage and registry=None. All code examples verified by running them in Python. Signed-off-by: k1chik <107162115+k1chik@users.noreply.github.com>

- GCCollector does not support registry=None on CPython; remove that claim - Remove unused CONTENT_TYPE_LATEST import from registry example - Fix 'value vs labels' section to correctly describe Summary (count_value/sum_value) and Histogram (buckets) Signed-off-by: k1chik <107162115+k1chik@users.noreply.github.com>

csmarchbanks

Thanks!

calestyo · 2026-05-03T23:08:48Z

Hey.

Thanks for improving on the documentation... that really helps a lot, compared to few years ago when I had to basically fiddle out most of these things from the code :-)

But one thing:

Why the yields in the collect()? I don’t quite get that.

Shouldn't collect() return sequences of *MetricFamily objects? You now return only a single one, and there even alternating ones like in:
a708480#diff-eadfec94068584be6fec5cce354e065ead34a754bc84ead564adf9e8c497ab10R237-R243

Or maybe you mean this as two independent examples?
What makes it confusing though is (which is however not from your commit):

client_python/docs/content/collector/custom.md

Lines 16 to 21 in 482656c

    
           def collect(self): 
        
               yield GaugeMetricFamily('my_gauge', 'Help text', value=7) 
        
               c = CounterMetricFamily('my_counter_total', 'Help text', labels=['foo']) 
        
               c.add_metric(['bar'], 1.7) 
        
               c.add_metric(['baz'], 3.8) 
        
               yield c

There collect() alternates between returning (well yielding) a Gauge and Counter (also not as sequences)?

I generally don't quite understand why one would yield here at all.

Isn't the reason for custom collectors to start every time fresh (and get rid of all stale metrics, labels, etc.)?
With yield wouldn't you keep all that?

Thanks,
Chris.

calestyo · 2026-05-03T23:32:37Z

Shouldn't do stuff late night... yield of course makes it an iterator o.O

Still I wonder, is there any advantage over using that in the current form rather than return?

I mean is Python' GC smart enough that it can already clean the objects that have already been yielded and thus give an advantage by reducing e.g. memory use for very large metric sets?

In:

        yield GaugeMetricFamily('my_gauge', 'Help text', value=7)
        c = CounterMetricFamily('my_counter_total', 'Help text', labels=['foo'])
        c.add_metric(['bar'], 1.7)
        c.add_metric(['baz'], 3.8)
        yield c

I could imagine that it may release the GaugeMetricFamily object early... but c? It still has a reference?

In any case it would be nice to have in the docs why generators are used (if it's memory advantages) and how to use them so that this actually works out. :-)

k1chik · 2026-05-04T14:43:57Z

Hi @calestyo, thanks for the read! glad to see you've found the docs useful.

And yes, my bad, the code examples are missing the def collect(self): wrapper which makes it unclear what context they're meant to be in, i'll fix that asap.

Your GC read is right too, yield GaugeMetricFamily(...) inline like that can be reclaimed as soon as the caller moves on, but c keeps a reference in the generator frame until it's done. The main reason for yield here isn't memory though, it's just that the registry iterates over whatever collect() returns and a generator is cleaner than building a list. For most collectors you'd never notice the difference anyway.

I'll open a follow-up PR with both fixes and add a note explaining why generators are used. Thanks again for your feedback.

Shouldn't do stuff late night... - :-)

* docs: clarify collect() generator usage and API Reference snippet context Add a note to the collect() protocol section explaining that yield is idiomatic (generator iterates lazily, no state between scrapes) and a preamble to the API Reference section clarifying that code snippets belong inside a collect() method. Follows up on review feedback in #1169. Signed-off-by: k1chik <kkukdia@gmail.com> * docs: split InfoMetricFamily example into two separate blocks The single block with two yield statements looked like one collect() yielding both patterns. Split into labelled prose + code pairs to make clear they are alternatives, not sequential yields. Signed-off-by: k1chik <kkukdia@gmail.com> --------- Signed-off-by: k1chik <kkukdia@gmail.com>

calestyo · 2026-05-08T00:06:53Z

Hmm, but if you say memory isn't the main reason, the real one should perhaps be given?

And if memory is at least part of the reason, then it might have made sense to explicitly mention that and also explain to people that ideally they should re-use the same identifier so that the GC can kick in.

Even perhaps if they have a case where it's reasonable to first collect a few metrics and only yield all of them afterwards, like in:

for loopOver in SomeJSONelements:
    metric[0].add_metric()
    metric[1].add_metric()
for m in metric:
    yield m

#... next metric again uses `metric` var

I.e. telling that it's still good to use only one identifier metric here, so that the GC can work

k1chik · 2026-05-08T01:30:59Z

Hi Chris, thanks for the response. Good point on both counts! I'll add a note that memory is at least part of the reason for using yield, and mention that reusing the same identifier helps the GC kick in sooner. PR coming shortly :)

@calestyo

Addresses follow-up feedback on prometheus#1169 from @calestyo: the existing yield explanation mentioned lazy iteration but omitted memory as a secondary benefit and gave no guidance on variable reuse for GC efficiency. Added a paragraph explaining that yielding inline lets Python reclaim the object as soon as the registry advances, while a named variable stays alive until rebound. Also added a short loop example showing how reusing the same variable name across iterations gives the GC the opportunity to reclaim each object before the next is allocated. Signed-off-by: k1chik <107162115+k1chik@users.noreply.github.com>

k1chik added 2 commits April 24, 2026 11:04

csmarchbanks approved these changes Apr 24, 2026

View reviewed changes

csmarchbanks merged commit 482656c into prometheus:master Apr 24, 2026
12 checks passed

k1chik mentioned this pull request May 4, 2026

docs: follow-up fixes for collect() generator examples (#1169) #1172

Merged

k1chik mentioned this pull request May 8, 2026

docs: add memory benefit and variable-reuse note to collect() section #1173

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add API reference for CollectorRegistry and custom collector classes#1169

docs: add API reference for CollectorRegistry and custom collector classes#1169
csmarchbanks merged 2 commits intoprometheus:masterfrom
k1chik:fix/1163-api-reference-registry-collectors

k1chik commented Apr 24, 2026

Uh oh!

csmarchbanks left a comment

Uh oh!

Uh oh!

calestyo commented May 3, 2026

Uh oh!

calestyo commented May 3, 2026

Uh oh!

k1chik commented May 4, 2026

Uh oh!

calestyo commented May 8, 2026

Uh oh!

k1chik commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

k1chik commented Apr 24, 2026

Uh oh!

csmarchbanks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

calestyo commented May 3, 2026

Uh oh!

calestyo commented May 3, 2026

Uh oh!

k1chik commented May 4, 2026

Uh oh!

calestyo commented May 8, 2026

Uh oh!

k1chik commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants