INDEX
Explanations
phrases related to official announcements or statements
pronouns and auxiliary verbs indicating subjects and actions in sentences
New Auto-Interp
Negative Logits
extant
-0.62
Federation
-0.61
Parenthood
-0.59
taining
-0.58
vable
-0.56
Tenth
-0.56
bachelor
-0.56
Ivy
-0.56
Bachelor
-0.56
Fine
-0.54
POSITIVE LOGITS
invariably
1.23
immediately
1.13
usually
1.02
inevitably
1.02
instantly
0.97
typically
0.94
often
0.92
Enlarge
0.90
mediately
0.87
greeted
0.86
Activations Density 0.236%