INDEX
Explanations
verbs indicating strong emotions or personal perspectives
forms of the verb "to be" in various tenses
New Auto-Interp
Negative Logits
Previous
-0.54
inav
-0.54
atives
-0.53
Which
-0.51
Benefits
-0.51
stop
-0.51
urry
-0.49
IMAGES
-0.49
Moines
-0.48
osate
-0.47
POSITIVE LOGITS
nt
0.89
supposed
0.83
definitely
0.78
senal
0.77
not
0.73
indeed
0.72
unlikely
0.71
rael
0.71
always
0.71
able
0.71
Activations Density 0.830%