INDEX
Explanations
references to medical terminology or treatment methods
New Auto-Interp
Negative Logits
GenerationType
-1.00
المعيارى
-0.95
myſelf
-0.95
་་
-0.93
Jefus
-0.91
pleaſure
-0.90
endphp
-0.89
doubtnut
-0.87
Monfieur
-0.87
uſed
-0.86
POSITIVE LOGITS
,
0.56
)
0.56
tr
0.56
e
0.55
...)
0.55
er
0.55
os
0.54
<bos>
0.54
are
0.53
(
0.53
Activations Density 0.438%