INDEX
Explanations
prepositions indicating location or position
New Auto-Interp
Negative Logits
eydi
-0.19
Záp
-0.17
egin
-0.17
ishi
-0.15
sembler
-0.15
vais
-0.15
berapa
-0.15
aggio
-0.15
ponse
-0.15
folio
-0.14
POSITIVE LOGITS
inclusion
0.18
brit
0.18
case
0.18
812
0.18
817
0.18
728
0.17
reality
0.17
addition
0.17
Addition
0.17
676
0.17
Activations Density 0.027%