INDEX
Explanations
names and abbreviations, as well as words related to physical states of being or actions like "sleep," "suicide," and "aggressiveness."
New Auto-Interp
Negative Logits
Ĥª
-0.66
»Ĵ
-0.63
tremend
-0.62
ADRA
-0.60
misdem
-0.58
bluff
-0.57
Skydragon
-0.56
clicks
-0.56
uninsured
-0.56
premiums
-0.56
POSITIVE LOGITS
phant
0.81
vre
0.81
ghan
0.75
anasia
0.71
ritch
0.69
rahim
0.69
rencies
0.68
ionage
0.68
naissance
0.67
mot
0.67
Activations Density 4.632%