INDEX
Explanations
punctuation and various symbols used in text
New Auto-Interp
Negative Logits
myſelf
-0.62
ſta
-0.59
indígen
-0.59
miniaturka
-0.56
Perſ
-0.56
estadounid
-0.54
ArgsConstructor
-0.54
Reſ
-0.54
pulseira
-0.52
hırka
-0.51
POSITIVE LOGITS
However
0.56
Also
0.54
Others
0.54
Their
0.54
Finally
0.53
They
0.51
Lastly
0.50
Similarly
0.50
Meanwhile
0.50
Those
0.49
Activations Density 1.307%