INDEX
Explanations
questions and serious discussions revolving around personal experiences and societal issues
New Auto-Interp
Negative Logits
[...
-0.20
[â̦]↵
-0.20
[...]
-0.19
[â̦]...↵
-0.19
[â̦]
-0.18
enthusi
-0.17
[...]↵↵
-0.17
[â̦]↵↵
-0.16
...↵
-0.15
actionDate
-0.15
POSITIVE LOGITS
udu
0.12
sut
0.12
hangi
0.12
uzzy
0.11
achte
0.11
avra
0.11
celik
0.11
vlas
0.11
separator
0.10
sperma
0.10
Activations Density 1.211%