INDEX
Explanations
allergies and medical conditions
New Auto-Interp
Negative Logits
an
1.12
the
1.06
ing
0.99
ان
0.98
ும்
0.92
يج
0.88
۔
0.87
Α
0.86
ில்
0.82
ة
0.81
POSITIVE LOGITS
↵↵
1.10
to
0.92
-
0.92
↵
0.89
C
0.85
B
0.85
ys
0.84
8
0.82
P
0.80
ien
0.80
Activations Density 0.002%