INDEX
Explanations
contractions with the letter 's
the phrase "It’s" or variations thereof that indicate a statement or observation
New Auto-Interp
Negative Logits
ESE
-0.79
ren
-0.72
igraph
-0.69
nel
-0.69
scope
-0.68
ollow
-0.67
%%
-0.67
DS
-0.67
stad
-0.66
umbnail
-0.66
POSITIVE LOGITS
gonna
1.16
unclear
1.15
gotta
1.15
been
1.08
impossible
1.03
doubtful
1.01
worth
1.01
easy
0.99
gotten
0.97
got
0.95
Activations Density 0.075%