INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
ﻝ
0.74
ﻭ
0.70
minify
0.70
ﺭ
0.69
ﻉ
0.69
mostrar
0.68
perone
0.66
joten
0.66
pediu
0.66
kiu
0.66
POSITIVE LOGITS
ق
0.76
лно
0.62
zelfde
0.57
kiej
0.57
ר
0.56
лни
0.55
ح
0.53
ו
0.52
ע
0.52
zione
0.52
Activations Density 0.000%
No Known Activations
This feature has no known activations.