INDEX
Explanations
instances where people are expressing their opinions or making evaluations
words related to evaluating or considering options
New Auto-Interp
Negative Logits
Kart
-0.72
algia
-0.72
ãĥĨãĤ£
-0.71
nered
-0.70
etheless
-0.69
later
-0.67
anie
-0.67
chief
-0.66
ITIES
-0.65
Parables
-0.65
POSITIVE LOGITS
weigh
1.16
weighing
0.97
weighed
0.93
weights
0.88
heaviest
0.84
pros
0.82
weighs
0.80
heavily
0.80
heavier
0.78
iless
0.77
Activations Density 0.029%