INDEX
Explanations
specific characters or symbols
punctuation marks and formatting in the text
New Auto-Interp
Negative Logits
orbiting
-0.75
zbollah
-0.74
apons
-0.73
disapprove
-0.72
disappro
-0.71
phia
-0.71
Saiyan
-0.71
contempl
-0.70
droid
-0.69
ushima
-0.68
POSITIVE LOGITS
READ
0.99
CBC
0.97
gary
0.84
Adding
0.83
However
0.82
Cr
0.82
Some
0.82
Mont
0.81
Statistics
0.81
Those
0.81
Activations Density 0.381%