INDEX
Explanations
discussions about community building and social integration among Muslims
New Auto-Interp
Negative Logits
ellen
-0.17
baptized
-0.17
Trou
-0.16
canh
-0.16
bapt
-0.16
Trouble
-0.15
Krishna
-0.15
Trou
-0.15
Å©
-0.15
Sans
-0.14
POSITIVE LOGITS
tas
0.20
etiqu
0.19
narr
0.18
igham
0.18
Shay
0.17
Ras
0.17
Tas
0.16
mash
0.16
juris
0.16
Mash
0.16
Activations Density 0.264%