INDEX
Explanations
phrases and sentences discussing opinions, beliefs, and doubts
expressions of doubt or skepticism about opinions or statements
New Auto-Interp
Negative Logits
itton
-0.68
pez
-0.67
asia
-0.66
ulner
-0.65
ERT
-0.63
largeDownload
-0.57
beware
-0.55
whenever
-0.55
ahi
-0.55
hap
-0.55
POSITIVE LOGITS
anymore
1.55
nor
1.16
necessarily
1.12
ever
1.11
any
1.09
EVER
1.05
slightest
1.03
anything
1.00
bothered
0.98
anybody
0.93
Activations Density 0.222%