INDEX
Explanations
words related to personal names or proper nouns, especially those with strong emotional connotations
references to social and cultural themes
New Auto-Interp
Negative Logits
Pwr
-0.63
Moff
-0.59
AE
-0.55
Osw
-0.53
Gadget
-0.50
Colony
-0.47
Yorkshire
-0.46
Formula
-0.46
Gat
-0.45
Zur
-0.44
POSITIVE LOGITS
ioned
0.71
arant
0.68
odox
0.65
alin
0.62
ript
0.59
omas
0.59
omal
0.59
sing
0.59
ographed
0.58
roid
0.58
Activations Density 2.027%