INDEX
Explanations
statements expressing belief, opinion, or personal perspective
New Auto-Interp
Negative Logits
artney
-0.72
elle
-0.67
ption
-0.64
fare
-0.62
purportedly
-0.61
bies
-0.59
eller
-0.58
ueless
-0.57
lopp
-0.57
iae
-0.57
POSITIVE LOGITS
poke
0.80
strongly
0.71
myself
0.68
ħ
0.68
passionately
0.65
ĸ
0.63
fortunate
0.63
personally
0.60
§
0.60
ļé
0.59
Activations Density 11.750%