INDEX
Explanations
adjectives and verbs that express affirmation or agreement
sentences expressing opinions and emotions
New Auto-Interp
Negative Logits
});
-0.73
furthermore
-0.67
moreover
-0.62
estamp
-0.62
Additionally
-0.60
mentioned
-0.60
idon
-0.59
breaking
-0.58
ï¸ı
-0.57
recognizes
-0.57
POSITIVE LOGITS
merely
0.92
purely
0.89
elsewhere
0.77
mere
0.75
relegated
0.75
concentrate
0.74
passively
0.71
simply
0.71
concentrated
0.71
obscurity
0.71
Activations Density 1.102%