INDEX
Explanations
phrases related to potential actions, preferences, or choices
New Auto-Interp
Negative Logits
pyplot
-0.54
Figura
-0.54
sext
-0.52
واج
-0.50
ostock
-0.50
zur
-0.49
Sext
-0.48
yarar
-0.47
göre
-0.47
Awak
-0.46
POSITIVE LOGITS
<bos>
1.20
hoeddwyd
0.88
featureID
0.88
الإنجليزية
0.87
unknownFields
0.85
ModelExpression
0.81
tvguidetime
0.80
IsMutable
0.78
fjspx
0.78
+#+
0.77
Activations Density 0.078%