INDEX
Explanations
words related to geographical locations or political events
occurrences of specific phonetic sounds or letter patterns
New Auto-Interp
Negative Logits
BILITY
-0.61
istg
-0.60
76561
-0.58
Scion
-0.57
deterrence
-0.57
NCT
-0.56
PID
-0.56
reins
-0.54
Wanted
-0.54
Berks
-0.54
POSITIVE LOGITS
arella
0.86
atchewan
0.73
estic
0.72
hin
0.70
iris
0.67
adan
0.67
bia
0.67
raised
0.67
oran
0.67
oya
0.66
Activations Density 0.097%