INDEX
Explanations
phrases signaling concern or emphasis
phrases related to concern or awareness
New Auto-Interp
Negative Logits
obal
-0.71
rang
-0.69
omas
-0.67
oru
-0.65
VIEW
-0.60
inous
-0.56
Mahjong
-0.55
cozy
-0.55
Together
-0.55
iltr
-0.54
POSITIVE LOGITS
IAS
0.67
ieth
0.64
UME
0.63
.,
0.59
orsche
0.58
enth
0.58
umes
0.56
Pole
0.56
enery
0.55
,
0.55
Activations Density 0.054%