INDEX
Explanations
phrases expressing the desire to seek help or assistance
New Auto-Interp
Negative Logits
wik
-0.69
suggestion
-0.62
Globe
-0.60
imon
-0.59
Flat
-0.59
indication
-0.57
Sheen
-0.57
Fraz
-0.56
Meaning
-0.56
Apostle
-0.56
POSITIVE LOGITS
've
1.01
'll
0.94
're
0.92
could
0.91
'd
0.91
'm
0.89
owe
0.88
should
0.87
SHOULD
0.86
selves
0.83
Activations Density 1.116%