INDEX
Explanations
sections of text indicating the speaker is reluctant to provide further details
phrases indicating reluctance to discuss certain topics
New Auto-Interp
Negative Logits
arov
-0.75
ãĥīãĥ©
-0.73
æ©
-0.70
turnover
-0.65
ereo
-0.59
roots
-0.59
ushi
-0.57
ounter
-0.56
nell
-0.55
kefeller
-0.55
POSITIVE LOGITS
haha
1.13
lest
1.08
ðŁĺ
1.03
ðŁĻĤ
1.02
here
1.01
;)
0.98
rant
0.97
:)
0.97
:(
0.96
suffice
0.93
Activations Density 0.312%