INDEX
Explanations
adjectives indicating strong beliefs or loyalty
words that indicate strong personal beliefs or unwavering support
New Auto-Interp
Negative Logits
ammy
-1.05
hops
-0.91
nesota
-0.83
ombies
-0.74
ammers
-0.72
uden
-0.72
NetMessage
-0.72
ovember
-0.70
APH
-0.70
anders
-0.68
POSITIVE LOGITS
ly
1.10
ness
0.85
wart
0.83
supporter
0.76
staunch
0.76
nesses
0.74
ELY
0.71
ity
0.70
exponent
0.70
kowski
0.69
Activations Density 0.027%