INDEX
Explanations
concepts related to complex social dynamics and interactions
New Auto-Interp
Negative Logits
so
-0.43
even
-0.42
and
-0.41
I
-0.40
()):
-0.40
()
-0.39
Sen
-0.38
fhir
-0.38
Fehl
-0.37
Kol
-0.37
POSITIVE LOGITS
entanto
1.02
ftagPool
0.97
however
0.93
όμως
0.92
però
0.90
however
0.89
autorytatywna
0.88
########.
0.86
Obrázky
0.83
فريبيس
0.83
Activations Density 0.976%