INDEX
Negative Logits
indoctr
0.60
fervor
0.52
kiddos
0.50
instructor
0.48
apparel
0.47
caloric
0.47
instructional
0.45
victimization
0.45
regimen
0.45
instructor
0.44
POSITIVE LOGITS
Anglican
0.61
Exploring
0.59
carers
0.57
explores
0.56
recognisable
0.56
Exploring
0.55
centenary
0.55
explore
0.55
Diocese
0.55
apologised
0.54
Activations Density 0.001%