INDEX
Explanations
questions starting with "What do you think?"
questions that prompt discussion or opinion seeking
New Auto-Interp
Negative Logits
WAYS
-0.87
ahime
-0.73
boats
-0.70
Interstitial
-0.69
boards
-0.69
mobi
-0.68
thur
-0.67
legram
-0.65
acerb
-0.65
fox
-0.65
POSITIVE LOGITS
happen
0.80
?]
0.77
omsday
0.72
actic
0.69
iotic
0.66
ederal
0.66
iosyncr
0.65
notation
0.64
mean
0.64
?),
0.63
Activations Density 0.051%