INDEX
Explanations
personal pronouns followed by modal verbs, indicating possibility or permission
references to people and their potential actions or capabilities
New Auto-Interp
Negative Logits
sqor
-0.81
Cosponsors
-0.74
awei
-0.61
municip
-0.60
Reviewer
-0.58
partName
-0.55
yet
-0.55
millenn
-0.55
idav
-0.54
juven
-0.53
POSITIVE LOGITS
can
1.18
wont
1.04
dont
0.99
wouldn
0.97
can
0.91
won
0.89
could
0.89
don
0.88
doesnt
0.87
CAN
0.84
Activations Density 0.199%