INDEX
Explanations
editorial notes within a document
references to editorial notes or comments
New Auto-Interp
Negative Logits
rises
-0.75
pered
-0.73
avid
-0.68
ffff
-0.68
crow
-0.66
Estonia
-0.66
mallow
-0.65
bands
-0.64
Vulcan
-0.64
ought
-0.64
POSITIVE LOGITS
Editor
0.95
Editor
0.87
Editors
0.86
editor
0.81
Clicker
0.81
ially
0.80
ial
0.73
Editorial
0.73
Reader
0.72
editor
0.71
Activations Density 0.009%