INDEX
Explanations
past tense verbs that imply an action or decision
New Auto-Interp
Negative Logits
ById
-0.68
ateg
-0.67
gart
-0.66
mong
-0.66
usat
-0.64
WER
-0.63
cknowled
-0.62
vell
-0.62
Cosponsors
-0.61
HCR
-0.61
POSITIVE LOGITS
him
0.68
them
0.62
Lanc
0.61
Hitman
0.61
Wiz
0.60
Stamford
0.59
THEM
0.58
join
0.58
],"
0.57
Velvet
0.57
Activations Density 0.190%