INDEX
Explanations
the phrase "the only thing" followed by a noun or gerund
repeated references to "the only thing" or similar phrases that emphasize singular importance or focus
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.80
baugh
-0.79
blast
-0.74
oufl
-0.70
fixme
-0.69
har
-0.66
println
-0.66
nec
-0.66
holm
-0.66
xtap
-0.65
POSITIVE LOGITS
that
0.98
happening
0.87
we
0.84
separating
0.79
you
0.79
THAT
0.79
bothering
0.75
they
0.74
preventing
0.74
I
0.73
Activations Density 0.061%