INDEX
Explanations
references to similarity or equivalence in contexts discussing amounts, measures, and conditions
New Auto-Interp
Negative Logits
ng
-0.16
ebi
-0.15
öl
-0.15
anzi
-0.15
ibi
-0.15
รม
-0.15
inning
-0.15
ka
-0.14
à¥įसर
-0.14
#Region
-0.14
POSITIVE LOGITS
nock
0.21
ukkan
0.18
214
0.16
iah
0.14
raig
0.14
Trav
0.14
JK
0.14
anford
0.14
饰
0.14
nisi
0.13
Activations Density 0.149%