INDEX
Explanations
characters or symbols that indicate transitions or prepositions
prefix "pre" or "fore"
New Auto-Interp
Negative Logits
asteroide
-0.48
TagMode
-0.47
Spoljašnje
-0.46
Hentet
-0.46
pantalón
-0.46
chaleco
-0.45
nahilalakip
-0.44
aikaa
-0.44
cerely
-0.42
kurtka
-0.41
POSITIVE LOGITS
Пред
0.55
Пред
0.55
Pref
0.54
Pron
0.50
pré
0.50
пред
0.49
Προ
0.48
beforeEach
0.48
Pred
0.46
pref
0.46
Activations Density 0.002%