INDEX
Explanations
texts related to technical specifications and manufacturing details
acronyms and initialisms related to organizations or political entities
New Auto-Interp
Negative Logits
selves
-0.56
oneself
-0.49
.):
-0.49
infringing
-0.49
Attribution
-0.45
anyway
-0.45
emphasis
-0.44
anew
-0.43
academ
-0.43
:[
-0.42
POSITIVE LOGITS
has
1.25
was
1.23
became
1.20
gave
1.20
took
1.19
went
1.16
showed
1.15
appeared
1.14
seemed
1.14
began
1.13
Activations Density 1.058%