INDEX
Explanations
terms relating to highlighted or emphasized text
instances of an unusual special character or symbol
New Auto-Interp
Negative Logits
orts
-0.70
level
-0.66
eking
-0.63
erest
-0.62
ji
-0.62
zel
-0.61
zhou
-0.60
ega
-0.59
pei
-0.58
iple
-0.58
POSITIVE LOGITS
namely
1.23
perhaps
1.22
albeit
1.11
————
1.02
especially
1.01
particularly
1.01
something
0.98
––
0.96
maybe
0.95
————————
0.95
Activations Density 0.164%