INDEX
Explanations
mathematical equations or expressions related to equality and inequalities
New Auto-Interp
Negative Logits
vocês
-0.68
ſeveral
-0.67
kirch
-0.63
Chwiliwch
-0.63
ustedes
-0.60
neſs
-0.60
themſelves
-0.57
ſever
-0.55
foro
-0.55
ti
-0.53
POSITIVE LOGITS
>=</
1.83
/=
1.75
=
1.59
$=\
1.44
}=
1.41
.=
1.39
$=
1.36
)=
1.35
}=\
1.35
=\
1.35
Activations Density 1.553%