INDEX
Explanations
phrases related to personal relationships and associations
references to individuals' advocacy and support for various social and political causes
New Auto-Interp
Negative Logits
tomorrow
-0.72
?".
-0.63
Impossible
-0.63
?).
-0.63
TOTAL
-0.63
?!"
-0.62
ILLE
-0.61
={-0.61
unus
-0.60
ueless
-0.59
POSITIVE LOGITS
throughout
0.86
despite
0.82
particularly
0.79
stemming
0.76
especially
0.76
insofar
0.75
since
0.74
whom
0.73
evidenced
0.72
especially
0.71
Activations Density 0.446%