INDEX
Explanations
occurrences of the word "start" and its variations
New Auto-Interp
Negative Logits
ISE
-0.18
geh
-0.17
ABB
-0.17
abb
-0.15
okino
-0.15
ÑģÑĮ
-0.15
itage
-0.14
rouch
-0.14
itters
-0.14
arel
-0.14
POSITIVE LOGITS
ings
0.22
swith
0.22
ÂŃing
0.21
/end
0.20
seite
0.20
sWith
0.19
ingly
0.19
nings
0.18
ovnÃŃ
0.16
nin
0.16
Activations Density 0.034%