INDEX
Explanations
phrases indicating location or position in relation to other elements
New Auto-Interp
Negative Logits
ujednoznacz
-0.74
gyű
-0.57
manna
-0.57
brancas
-0.57
kasarigan
-0.57
cascada
-0.57
Lingkungan
-0.56
ValueStyle
-0.56
mijne
-0.55
vrijwilli
-0.55
POSITIVE LOGITS
انگلیسی
0.71
Nev
0.71
Biel
0.67
ән
0.67
日在
0.66
Bres
0.64
قایناقلار
0.63
chó
0.62
OrBuilder
0.62
anin
0.62
Activations Density 0.035%