INDEX
Explanations
phrases related to religious figures or worship
references to a divine or blessed figure
New Auto-Interp
Negative Logits
mascul
-0.76
sonian
-0.71
ilater
-0.68
parach
-0.66
Parsons
-0.64
CBS
-0.64
hel
-0.63
webs
-0.62
Simpson
-0.61
enegger
-0.61
POSITIVE LOGITS
ãĢIJ
1.01
Sakuya
0.98
Qin
0.93
âĶĢâĶĢ
0.93
Yuan
0.90
ãĢİ
0.88
Qian
0.87
Luo
0.87
Continent
0.86
Xu
0.85
Activations Density 0.759%