INDEX
Explanations
phrases related to opinions or judgments
expressions of personal opinions
New Auto-Interp
Negative Logits
zz
-0.78
escape
-0.74
resp
-0.66
++;
-0.66
Stretch
-0.62
zees
-0.62
ackle
-0.61
jar
-0.61
Thief
-0.60
rises
-0.60
POSITIVE LOGITS
opinion
3.89
Opinion
2.63
opin
2.55
opinions
2.53
inion
1.68
sentiment
1.46
judgement
1.43
judgment
1.37
impression
1.31
views
1.29
Activations Density 0.014%