INDEX
Explanations
religious entities and places
New Auto-Interp
Negative Logits
active
0.68
F
0.66
۲
0.63
I
0.63
carbons
0.62
food
0.61
B
0.60
中
0.60
Music
0.59
२
0.59
POSITIVE LOGITS
ع
0.80
ح
0.68
م
0.66
ق
0.66
ra
0.64
{0.64
lla
0.61
on
0.60
li
0.59
Figura
0.59
Activations Density 0.000%