Data Introspection¶
Data introspectors observe intermediate model responses,
and process data in batches when calling
.introspect().
Dataset Report¶
The DatasetReport bundles Familiarity, Duplicates and Dimension Reduction introspectors (below) in an interactive interface with various visualization options.
Familiarity¶
Familiarity quantifies how familiar a data point is to a specific dataset or subset, by fitting a probability distribution to the activations of the specified layer(s), and then evaluating the probability of any data sample according to the distribution.
Duplicates¶
Find near-duplicate data. Uses an approximate nearest neighbor to build a distance matrix for all samples and clusters the closest samples.
Dimension Reduction¶
Projects high dimensional activation data to a lower dimension, usually for consumption by a different introspector or for 2D or 3D visualization.