INDEX
Explanations
phrases or punctuation indicating emphasis or significance
instances of dashes or interruptions in text
New Auto-Interp
Negative Logits
protective
-0.72
othes
-0.65
packing
-0.65
ovi
-0.64
Piece
-0.64
oven
-0.63
ograph
-0.63
basil
-0.62
grave
-0.62
cream
-0.61
POSITIVE LOGITS
lance
0.94
fuck
0.91
jobs
0.89
micro
0.83
something
0.81
sil
0.81
release
0.79
sat
0.79
like
0.79
assisted
0.78
Activations Density 0.014%