INDEX
Explanations
phrases related to political discussions and accusations
New Auto-Interp
Negative Logits
Phi
-0.84
ulhu
-0.81
Compass
-0.77
Ń·
-0.73
Slide
-0.73
Confederation
-0.72
TAMADRA
-0.72
Noir
-0.72
Pok
-0.71
Bravo
-0.70
POSITIVE LOGITS
enough
1.28
sized
1.23
connected
1.21
defined
1.17
equipped
1.17
trained
1.16
known
1.15
appointed
1.13
established
1.13
placed
1.12
Activations Density 9.510%