INDEX
Explanations
intensely exaggerated expressions and references to "pods"
New Auto-Interp
Negative Logits
akcji
-0.62
الإنجليزية
-0.62
-0.61
découver
-0.59
disambiguazione
-0.57
مشين
-0.57
zyż
-0.57
Disqus
-0.57
roux
-0.57
Keim
-0.57
POSITIVE LOGITS
damn
0.76
////////////////
0.72
damned
0.67
Federal
0.66
под
0.65
darn
0.65
pod
0.63
damn
0.60
Под
0.55
Pod
0.54
Activations Density 0.073%