INDEX
Explanations
instances of the word "it" followed by a verb
phrases that discuss the conditionality or alternatives in various situations
New Auto-Interp
Negative Logits
Ratio
-0.65
Disclosure
-0.62
;;;;;;;;;;;;
-0.60
Syndrome
-0.58
Irwin
-0.58
Amen
-0.57
uda
-0.56
Rating
-0.56
ilib
-0.56
Contents
-0.56
POSITIVE LOGITS
oots
0.73
vg
0.68
wisely
0.67
consciously
0.66
theless
0.64
rha
0.63
ween
0.62
advoc
0.60
intentional
0.59
racuse
0.59
Activations Density 0.094%