INDEX
Explanations
phrases indicating agreement or disagreement with a particular plan or idea
expressions of agreement or disagreement with plans or opinions
New Auto-Interp
Negative Logits
ëĭ
-0.68
lookup
-0.62
datas
-0.61
egu
-0.59
Fortune
-0.59
Ops
-0.58
Blizz
-0.58
Moz
-0.58
Lever
-0.57
Worlds
-0.57
POSITIVE LOGITS
galitarian
0.88
76561
0.87
disagrees
0.82
endorse
0.82
endorsing
0.78
dismissing
0.77
recommending
0.75
disagree
0.74
vehemently
0.74
Rate
0.74
Activations Density 0.289%