INDEX
Explanations
mentions of staff and organizational roles within the text
New Auto-Interp
Negative Logits
illard
-0.15
ná
-0.14
stunt
-0.14
Podle
-0.14
promise
-0.13
Promise
-0.13
596
-0.13
averages
-0.13
dead
-0.13
782
-0.13
POSITIVE LOGITS
ouser
0.16
vyk
0.16
éĤ
0.15
åĤ¬
0.15
ÑĤоÑİ
0.14
aya
0.14
#ad
0.14
amo
0.14
akan
0.14
ç»ĻæĪij
0.13
Activations Density 0.087%