INDEX
Explanations
phrases that indicate comparison or opposition
words and phrases indicating measurement or comparison
New Auto-Interp
Negative Logits
owder
-0.67
achev
-0.67
%%
-0.65
utenberg
-0.62
Started
-0.60
Synopsis
-0.60
sugg
-0.60
hemer
-0.59
CVE
-0.58
Material
-0.58
POSITIVE LOGITS
elsewhere
1.17
others
1.14
other
1.08
neighbouring
0.97
Others
0.93
ours
0.90
Others
0.86
neighboring
0.86
other
0.83
Other
0.81
Activations Density 1.459%