INDEX
Explanations
mentions of numbers or measurements
occurrences of specific punctuation symbols or brackets
New Auto-Interp
Negative Logits
rette
-0.72
ĵĺ
-0.68
aband
-0.67
neigh
-0.66
tones
-0.66
cradle
-0.65
marine
-0.65
etsy
-0.65
ãĤ©
-0.64
reet
-0.64
POSITIVE LOGITS
However
0.96
Additionally
0.91
Furthermore
0.85
Similarly
0.84
Conversely
0.82
Likewise
0.82
Later
0.81
Nevertheless
0.80
Therefore
0.77
Alternatively
0.76
Activations Density 0.050%