INDEX
Explanations
words related to a single entity or object
phrases expressing singularity or uniqueness
New Auto-Interp
Negative Logits
yrinth
-0.86
thumbnails
-0.79
archives
-0.75
idon
-0.70
atu
-0.69
çī
-0.68
imir
-0.67
apesh
-0.67
types
-0.66
chi
-0.65
POSITIVE LOGITS
ounce
1.27
dime
1.23
penny
1.05
inch
0.98
THING
0.96
trace
0.83
shred
0.83
slightest
0.80
nickel
0.79
mention
0.79
Activations Density 0.176%