INDEX
Explanations
phrases that express a strong opinion or make a judgment
phrases that denote a condition or situation described with the word "that."
New Auto-Interp
Negative Logits
Voting
-0.70
anton
-0.69
SD
-0.66
Aden
-0.65
known
-0.64
Modified
-0.63
episode
-0.62
ESE
-0.61
classified
-0.61
Previous
-0.61
POSITIVE LOGITS
deserves
1.16
threatens
1.11
justifies
1.09
undermines
1.07
attracts
1.07
overwhel
1.07
eats
1.06
refuses
1.05
lacks
1.04
awaits
1.03
Activations Density 0.269%