INDEX
Explanations
phrases related to contrasting situations
instances of punctuation or ellipses that indicate pauses or omissions in text
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.91
ãĥ¼ãĥĨ
-0.78
artz
-0.76
psons
-0.72
isers
-0.71
ãĥİ
-0.69
ocratic
-0.65
izers
-0.64
odes
-0.63
plaque
-0.63
POSITIVE LOGITS
nuts
0.89
DOWN
0.86
sit
0.84
mmmm
0.84
there
0.84
CONT
0.81
rss
0.81
cffffcc
0.80
etc
0.78
interesting
0.77
Activations Density 0.015%