INDEX
Explanations
instances of argumentation and discussion related to social issues
New Auto-Interp
Negative Logits
GEBURTSDATUM
-1.07
Portail
-0.99
OGND
-0.93
дописавши
-0.86
فريبيس
-0.83
Chham
-0.82
IntoConstraints
-0.80
TagMode
-0.78
autorytatywna
-0.76
хьтан
-0.74
POSITIVE LOGITS
ifers
0.46
Ideally
0.46
correto
0.45
religieuses
0.45
So
0.45
itere
0.43
consistent
0.43
an
0.43
القرن
0.42
ваги
0.42
Activations Density 0.176%