INDEX
Explanations
mentions of specific names related to legal disputes or personal testimonies
New Auto-Interp
Negative Logits
ERO
-0.83
Malfoy
-0.76
ors
-0.73
orsche
-0.70
Eat
-0.69
Decay
-0.68
Strauss
-0.67
wn
-0.67
arial
-0.66
fierce
-0.65
POSITIVE LOGITS
gency
1.26
maid
1.19
chant
1.13
rill
1.05
lins
0.98
Rouge
0.95
cia
0.94
ikan
0.94
gence
0.92
cedes
0.91
Activations Density 7.491%