INDEX
Explanations
phrases related to small groups or quantities
repeated mentions of the word "small."
New Auto-Interp
Negative Logits
emis
-0.70
reon
-0.69
ilee
-0.65
ources
-0.64
Builder
-0.62
rob
-0.62
YR
-0.61
emy
-0.61
alin
-0.59
Indra
-0.59
POSITIVE LOGITS
pox
1.38
intestine
1.03
handful
0.97
fry
0.89
folk
0.87
amount
0.87
ish
0.87
consolation
0.86
sized
0.86
percentage
0.85
Activations Density 0.050%