INDEX
Explanations
phrases containing punctuation marks such as periods, colons, and quotation marks
periods or punctuations in the text
New Auto-Interp
Negative Logits
icz
-0.80
crab
-0.74
fermented
-0.74
crabs
-0.73
pse
-0.73
charact
-0.70
oun
-0.69
neighb
-0.69
exploited
-0.67
confir
-0.65
POSITIVE LOGITS
Lots
1.06
Includes
1.05
txt
1.05
Especially
1.03
Retrieved
1.03
Contains
1.03
Possibly
1.02
wav
1.01
Including
0.99
Comes
0.98
Activations Density 0.339%