INDEX
Explanations
topics related to community organization and religious leadership
New Auto-Interp
Negative Logits
baptized
-0.16
anse
-0.16
Amit
-0.16
ellen
-0.16
agal
-0.15
ulti
-0.15
Trouble
-0.15
Trou
-0.15
lys
-0.15
Krish
-0.14
POSITIVE LOGITS
etiqu
0.22
tas
0.19
juris
0.18
Tas
0.18
Alla
0.18
Ras
0.17
narr
0.17
'gc
0.17
compan
0.17
Shay
0.17
Activations Density 0.189%