INDEX
Explanations
religious and historical entities
New Auto-Interp
Negative Logits
ALL
-0.99
but
-0.99
may
-0.98
creating
-0.97
any
-0.93
يسة
-0.93
out
-0.91
or
-0.89
Any
-0.89
ONLY
-0.89
POSITIVE LOGITS
pageX
1.12
GUIDELINES
0.96
ย
0.95
actuel
0.95
particularly
0.95
nouve
0.94
ıç
0.94
✵
0.93
OFFERS
0.92
actually
0.91
Activations Density 0.010%