INDEX
Explanations
phrases emphasizing a specific location or point in a discussion
instances of the word "here," indicating a focus on emphasizing points or clarifying details within a context
New Auto-Interp
Negative Logits
parap
-0.66
ews
-0.64
Gujar
-0.60
Mehran
-0.59
Doors
-0.59
visors
-0.58
ggle
-0.58
tongues
-0.56
uously
-0.55
amaz
-0.54
POSITIVE LOGITS
abouts
1.54
tics
1.54
tical
1.54
tic
1.20
upon
0.81
with
0.80
guiActiveUn
0.78
after
0.73
from
0.70
here
0.69
Activations Density 0.061%