INDEX
Explanations
fragments related to quotations
rhetorical questions and expressions of opinion
New Auto-Interp
Negative Logits
abouts
-0.94
arij
-0.77
aldi
-0.77
intended
-0.75
isons
-0.75
oval
-0.74
herself
-0.73
racuse
-0.70
nesday
-0.70
allas
-0.69
POSITIVE LOGITS
Until
0.95
HAHAHAHA
0.94
Seriously
0.93
Especially
0.86
Anyway
0.85
Lear
0.85
And
0.84
Sure
0.84
Advertisements
0.83
Norm
0.82
Activations Density 0.368%