INDEX
Explanations
the word "object" and related forms
occurrences of the word "object" in various contexts
New Auto-Interp
Negative Logits
lla
-0.72
NPR
-0.71
ornia
-0.70
ricia
-0.70
artney
-0.67
elsius
-0.67
millenn
-0.67
Leone
-0.66
mingham
-0.66
corn
-0.64
POSITIVE LOGITS
ivity
1.15
ively
1.08
ivist
0.99
imus
0.94
ivism
0.93
ifying
0.92
ifies
0.92
ified
0.91
ion
0.90
ification
0.89
Activations Density 0.010%