INDEX
Explanations
phrases related to safety and medical guidelines
New Auto-Interp
Negative Logits
ValueStyle
-0.59
SerializedName
-0.53
tweaked
-0.53
pretty
-0.52
anskje
-0.52
Thankfully
-0.51
Interestingly
-0.51
just
-0.51
arguably
-0.51
hopefully
-0.50
POSITIVE LOGITS
الرياضيه
0.72
ſelf
0.69
متعلقه
0.67
poichè
0.64
Never
0.63
Consult
0.61
Moslem
0.61
ेशा
0.60
awsze
0.60
ATTENTION
0.60
Activations Density 0.170%