INDEX
Explanations
present tense verbs and associated sentence structures
New Auto-Interp
Negative Logits
would
-0.55
-
-0.53
we
-0.48
line
-0.48
I
-0.48
’
-0.47
were
-0.47
weren
-0.46
Mc
-0.46
”
-0.45
POSITIVE LOGITS
itſelf
1.08
Forumite
0.96
ſmall
0.96
thiệu
0.95
whoſe
0.93
becauſe
0.91
Theſe
0.91
InjectAttribute
0.91
himſelf
0.90
NameInMap
0.89
Activations Density 0.082%