INDEX
Explanations
words that imply strangeness or peculiarity
New Auto-Interp
Negative Logits
yoksa
-0.66
genomen
-0.63
Geplaatst
-0.62
وردار
-0.61
Datuak
-0.61
стоин
-0.60
respectivas
-0.58
ोंने
-0.57
iesis
-0.56
mortar
-0.55
POSITIVE LOGITS
strange
3.77
weird
3.39
strange
3.33
Strange
3.01
weird
2.94
Strange
2.92
strangest
2.87
Weird
2.84
Weird
2.81
wierd
2.80
Activations Density 0.109%