INDEX
Explanations
questions about identity or responsibility
subjects of inquiry or investigation expressed through the word "who."
New Auto-Interp
Negative Logits
framework
-0.77
requ
-0.69
ooters
-0.67
stantial
-0.64
irth
-0.63
Candle
-0.63
vity
-0.63
naissance
-0.62
externalToEVAOnly
-0.62
Supplemental
-0.61
POSITIVE LOGITS
infiltrated
0.82
benefited
0.80
afort
0.80
whom
0.78
inhab
0.73
footing
0.73
owns
0.72
deceived
0.71
orchestr
0.70
filib
0.69
Activations Density 0.082%