INDEX
Explanations
people's names, potentially those in politics
repeated names and surnames, particularly those starting with "Mc" or "O'"
New Auto-Interp
Negative Logits
Sakuya
-0.67
Ariel
-0.64
Golem
-0.62
exclus
-0.58
================================================================
-0.58
Torah
-0.57
meditation
-0.57
GamerGate
-0.56
Palo
-0.56
ãĥ´
-0.56
POSITIVE LOGITS
endon
1.11
rick
1.01
ney
0.98
ulty
0.97
igham
0.91
iffe
0.91
igan
0.90
agall
0.88
enzie
0.88
ridge
0.88
Activations Density 0.154%