INDEX
Explanations
terms related to the presence and properties of substances or materials
as part of a phrase
New Auto-Interp
Negative Logits
🔕
-0.59
تقاوى
-0.57
Autorisations
-0.56
HomeAsUpEnabled
-0.56
ValueStyle
-0.56
WriteBarrier
-0.55
насељу
-0.55
:✨
-0.55
pexpr
-0.53
дописавши
-0.52
POSITIVE LOGITS
@[+][
0.41
Tikang
0.36
propOrder
0.30
sometimes
0.30
rentiel
0.29
sometimes
0.26
ContentAlignment
0.26
gjerne
0.26
gning
0.26
suiker
0.26
Activations Density 0.044%