INDEX
Explanations
string patterns indicating quotations or spoken dialogue
words or phrases related to feelings of insecurity or uncertainty
New Auto-Interp
Negative Logits
FANT
-0.74
PID
-0.65
puff
-0.65
©¶æ
-0.63
pid
-0.63
souven
-0.62
ãĥ¼ãĥĨãĤ£
-0.61
iage
-0.61
è¦ļéĨĴ
-0.61
gimmick
-0.58
POSITIVE LOGITS
ï¸ı
1.01
cause
0.93
said
0.81
_>
0.79
Tx
0.78
alas
0.74
Pg
0.73
whe
0.72
fortunately
0.71
âĶĢâĶĢâĶĢâĶĢ
0.71
Activations Density 0.186%