INDEX
Explanations
proper nouns representing specific entities, most likely organizations or events
New Auto-Interp
Negative Logits
witch
-0.79
acid
-0.77
abuse
-0.75
ghazi
-0.74
âĹ¼
-0.73
Invalid
-0.72
iably
-0.71
20439
-0.71
anyway
-0.70
ratom
-0.70
POSITIVE LOGITS
goal
1.32
aim
1.23
announcement
1.09
centerpiece
1.07
inaugural
1.03
project
1.02
objective
1.01
initiative
1.01
purpose
1.00
trio
1.00
Activations Density 0.280%