INDEX
Explanations
Starts of sentences with names
New Auto-Interp
Negative Logits
га
0.81
х
0.74
htmlspecialchars
0.71
four
0.70
γ
0.68
constants
0.66
the
0.66
postion
0.66
three
0.64
that
0.62
POSITIVE LOGITS
Fiesta
0.70
Putri
0.66
Monate
0.64
Frau
0.63
Gazette
0.63
Hasta
0.63
Woman
0.62
Mama
0.62
↵↵
0.61
Musik
0.61
Activations Density 0.414%