INDEX
Explanations
words related to reactions or responses from people or groups
terms related to reactions and responses
New Auto-Interp
Negative Logits
skip
-0.66
Zot
-0.65
bilt
-0.63
atan
-0.63
cutting
-0.63
ramer
-0.61
cut
-0.61
BALL
-0.60
sonian
-0.59
rey
-0.59
POSITIVE LOGITS
aries
1.13
ivated
1.04
thereto
0.98
ivating
0.91
ivation
0.85
ariat
0.83
iveness
0.83
naires
0.79
aire
0.79
gif
0.76
Activations Density 0.074%