INDEX
Explanations
mentions of behaviors or situations associated with extreme or irrational actions
New Auto-Interp
Negative Logits
Lot
-0.64
LCS
-0.62
silenced
-0.60
LECT
-0.60
âĢ¢âĢ¢
-0.58
NRS
-0.58
Presbyterian
-0.57
ĵĺ
-0.57
PB
-0.57
governed
-0.57
POSITIVE LOGITS
cheon
1.56
atics
1.46
atic
1.46
acy
1.34
atical
1.27
etic
1.15
acies
1.15
aris
1.14
acia
1.10
amic
1.10
Activations Density 0.042%