INDEX
Explanations
positive and negative qualities or characteristics
phrases highlighting positive aspects or "good things" about a topic
New Auto-Interp
Negative Logits
chair
-0.82
urated
-0.79
igate
-0.73
mens
-0.73
ALK
-0.72
utm
-0.71
rained
-0.70
20439
-0.69
eyes
-0.69
alf
-0.69
POSITIVE LOGITS
kicker
0.82
bonus
0.73
Bonus
0.70
Solitaire
0.69
downside
0.69
happens
0.64
Bonus
0.64
:]
0.64
surprises
0.63
Pinball
0.62
Activations Density 0.197%