INDEX
Explanations
words related to medical treatments and conditions
New Auto-Interp
Negative Logits
แ
-0.66
்கள்
-0.64
ையும்
-0.62
magát
-0.62
égek
-0.61
္
-0.58
ząca
-0.58
TO
-0.58
ും
-0.57
طيع
-0.57
POSITIVE LOGITS
itſelf
1.11
iſt
1.02
occafion
1.01
ProtoMessage
1.00
ſever
0.98
ſind
0.96
Efq
0.96
myſelf
0.94
متعلقه
0.93
raiſ
0.93
Activations Density 1.129%