INDEX
Explanations
phrases related to actions or statements that stand out distinctly
the term 'Out' and related phrases indicating a release or exit
New Auto-Interp
Negative Logits
precedent
-0.62
stem
-0.60
religiously
-0.59
du
-0.59
canon
-0.59
fir
-0.58
torn
-0.58
consult
-0.58
glac
-0.57
dear
-0.57
POSITIVE LOGITS
Out
3.45
OUT
2.00
Out
1.95
out
1.91
outs
1.67
OUT
1.63
Outside
1.48
Off
1.41
Over
1.29
Down
1.27
Activations Density 0.010%