INDEX
Explanations
phrases related to reactions or responses to various situations or events
New Auto-Interp
Negative Logits
holder
-0.79
iciency
-0.76
hold
-0.74
locked
-0.71
fficiency
-0.66
Sinai
-0.66
inav
-0.64
Sustainable
-0.63
scribe
-0.63
holding
-0.62
POSITIVE LOGITS
ivated
1.33
ivation
1.20
ivating
1.17
aries
1.13
reaction
1.12
reactions
1.10
Reaction
0.99
iv
0.87
negatively
0.86
ivity
0.85
Activations Density 8.251%