INDEX
Explanations
phrases indicating a contrast or continuation of thought
the phrase "there" and its variations, indicating the presence of statements about existence or conditions
New Auto-Interp
Negative Logits
ointed
-0.64
ãĥĺ
-0.63
comprom
-0.60
shoot
-0.60
transfer
-0.59
rient
-0.57
Finish
-0.57
insert
-0.56
ONSORED
-0.56
Seym
-0.56
POSITIVE LOGITS
are
1.24
exists
1.21
aren
1.15
seems
1.11
's
1.11
appears
1.10
ARE
1.07
isn
1.06
is
1.06
exist
1.03
Activations Density 0.096%