INDEX
Explanations
conjunctions and commas
punctuation and conjunctions that indicate a contrast or connection in ideas
New Auto-Interp
Negative Logits
tained
-0.55
venth
-0.55
aughtered
-0.53
guiActiveUn
-0.52
acket
-0.52
ulla
-0.50
ÂŃ
-0.48
Tours
-0.48
Partnership
-0.48
elfth
-0.47
POSITIVE LOGITS
huh
1.08
etc
0.93
please
0.91
eh
0.91
yeah
0.88
dont
0.87
doesnt
0.85
preferably
0.80
but
0.76
tho
0.75
Activations Density 0.415%