INDEX
Explanations
phrases that denote opportunities or possibilities for action
New Auto-Interp
Negative Logits
atoon
-0.75
ileaks
-0.69
Logged
-0.65
Zip
-0.64
indle
-0.63
spills
-0.62
Appalach
-0.61
attm
-0.59
iba
-0.59
irit
-0.58
POSITIVE LOGITS
choice
0.97
opportunity
0.89
permission
0.88
chance
0.85
choice
0.79
treatment
0.77
ration
0.75
licence
0.72
otomy
0.72
latitude
0.71
Activations Density 0.070%