INDEX
Explanations
terms related to political figures pursuing information
verbs that indicate ongoing actions
New Auto-Interp
Negative Logits
advertisement
-0.75
sequence
-0.72
Spoiler
-0.68
glitch
-0.65
destro
-0.65
Ö
-0.64
dash
-0.63
ammy
-0.63
wash
-0.62
pict
-0.62
POSITIVE LOGITS
luaj
0.75
................................................................
0.72
ingen
0.63
diplomacy
0.63
respecting
0.61
Cuba
0.61
Finland
0.59
ikk
0.59
Ecuador
0.59
Emb
0.59
Activations Density 0.000%