INDEX
Explanations
proper nouns related to locations or entities with numbers or counting elements in them
mentions of numbers and numerical references
New Auto-Interp
Negative Logits
includ
-0.75
inating
-0.69
yrim
-0.66
ushes
-0.65
ained
-0.64
vt
-0.64
USD
-0.64
activity
-0.64
adel
-0.64
clock
-0.63
POSITIVE LOGITS
teen
1.05
Hundred
1.01
Mile
0.96
Eye
0.93
Thousand
0.92
Lives
0.92
Isles
0.89
een
0.87
Vision
0.85
Eyes
0.84
Activations Density 0.051%