INDEX
Explanations
proper nouns related to current events, such as names of people and places
significant or critical events and situations
New Auto-Interp
Negative Logits
estim
-0.66
flyers
-0.64
marg
-0.63
ilk
-0.63
iod
-0.61
princ
-0.61
printing
-0.60
é¾į
-0.59
ranch
-0.59
ishi
-0.58
POSITIVE LOGITS
02
1.41
01
1.36
03
1.35
00
1.30
04
1.29
05
1.25
06
1.22
07
1.15
Replay
1.13
08
1.13
Activations Density 0.010%