INDEX
Explanations
confirmations or agreements in sentences
affirmations or confirmations in the text
New Auto-Interp
Negative Logits
tnc
-0.80
Gleaming
-0.76
bage
-0.74
20439
-0.72
RAW
-0.71
rament
-0.70
actionDate
-0.66
Redd
-0.66
vati
-0.64
ocene
-0.64
POSITIVE LOGITS
terday
1.53
sir
0.78
indeed
0.75
evil
0.71
hua
0.67
kidding
0.67
eed
0.66
yes
0.63
Pryor
0.63
Sir
0.63
Activations Density 0.025%