INDEX
Explanations
proper nouns related to current events and organizations
proper nouns, particularly names of people, places, and cultures
New Auto-Interp
Negative Logits
jri
-0.70
oppable
-0.69
prest
-0.68
Niet
-0.58
sit
-0.58
enegger
-0.56
/,
-0.55
akespe
-0.54
ãĥ¼ãĥĨ
-0.54
ilaterally
-0.54
POSITIVE LOGITS
reacts
0.65
celebrates
0.58
)))
0.57
][
0.57
::
0.55
approves
0.55
Moves
0.54
Motors
0.54
Brewing
0.53
ãĥ
0.53
Activations Density 0.324%