INDEX
Explanations
instances where the phrase "I just" is used
the repeated use of the word "just."
New Auto-Interp
Negative Logits
luster
-0.67
antage
-0.65
PLUS
-0.65
ikuman
-0.62
Pett
-0.62
adversary
-0.62
Prelude
-0.61
cous
-0.60
antis
-0.60
oro
-0.59
POSITIVE LOGITS
ifications
1.12
ifiable
1.05
if
0.91
gotta
0.90
ifi
0.88
ified
0.83
IFIED
0.80
kidding
0.78
itia
0.77
wanna
0.77
Activations Density 0.074%