INDEX
Explanations
references to prominent historical figures and thinkers in social and political theory
authors and their statements
New Auto-Interp
Negative Logits
SharedCtor
-0.69
мәкал
-0.67
NSCoder
-0.63
Personendaten
-0.61
rungsseite
-0.60
Italijanski
-0.60
ftagPool
-0.59
SuppressLint
-0.59
Personensuche
-0.58
хьтан
-0.57
POSITIVE LOGITS
Citiți
0.36
lioz
0.35
vezes
0.33
between
0.33
pozor
0.33
zwischen
0.32
passiert
0.32
lgari
0.31
sadly
0.31
tussen
0.31
Activations Density 0.116%