INDEX
Explanations
references to charitable organizations and their activities
New Auto-Interp
Negative Logits
.uml
-0.15
Sist
-0.15
ritch
-0.14
.jackson
-0.14
Vatican
-0.14
VRTX
-0.14
@student
-0.14
TableCell
-0.14
opic
-0.14
nackte
-0.14
POSITIVE LOGITS
Salvation
0.21
Salv
0.21
salv
0.20
corps
0.20
cad
0.19
Cad
0.18
kettle
0.17
.sal
0.17
ens
0.17
Sal
0.17
Activations Density 0.010%