INDEX
Explanations
the word 'only' followed by a number from 1 to 10
the word "only" or phrases indicating exclusivity
New Auto-Interp
Negative Logits
idon
-0.80
insula
-0.68
mass
-0.64
actionDate
-0.64
MAP
-0.63
align
-0.62
senal
-0.61
Dynamics
-0.61
iths
-0.60
gnu
-0.60
POSITIVE LOGITS
thing
0.82
marginally
0.80
drawback
0.77
logical
0.74
benef
0.73
recourse
0.73
valid
0.69
half
0.69
consolation
0.68
surviving
0.68
Activations Density 0.045%