INDEX
Explanations
phrases indicating contribution or participation to a written report or story
mentions of reports or articles
New Auto-Interp
Negative Logits
journal
-0.64
seiz
-0.60
registers
-0.60
joints
-0.59
drawings
-0.58
journals
-0.58
scrimmage
-0.57
demos
-0.56
sed
-0.55
ingred
-0.55
POSITIVE LOGITS
OSP
0.68
inion
0.68
<|endoftext|>
0.66
loo
0.64
///
0.63
herical
0.63
Pacific
0.62
ranch
0.62
ept
0.61
.</
0.61
Activations Density 0.069%