INDEX
Explanations
words related to conflicts or troubles
New Auto-Interp
Negative Logits
pta
-0.92
ortmund
-0.87
olphins
-0.86
atari
-0.86
regation
-0.85
iets
-0.84
OND
-0.83
ULE
-0.82
kamp
-0.80
udence
-0.79
POSITIVE LOGITS
downright
1.30
prone
1.21
lacking
1.16
incapable
1.14
indistinguishable
1.12
riddled
1.10
aest
1.09
resistant
1.07
devoid
1.06
capable
1.05
Activations Density 0.324%