INDEX
Explanations
words related to sharp objects or actions
occurrences of the distinct linguistic feature "th."
New Auto-Interp
Negative Logits
ãĤŃ
-0.84
76561
-0.79
ITED
-0.77
Tokens
-0.71
assetsadobe
-0.68
VICE
-0.67
HAEL
-0.67
FAULT
-0.67
ãĤ¹ãĥĪ
-0.65
Katz
-0.65
POSITIVE LOGITS
ieving
1.11
istle
1.04
umb
1.01
umping
1.00
urst
0.97
orns
0.97
ttp
0.97
umbnails
0.95
ouse
0.94
warts
0.93
Activations Density 0.007%