INDEX
Explanations
instances of assertion and emphasis in argumentation
New Auto-Interp
Negative Logits
Ìģc
-0.17
exus
-0.15
ÑĪив
-0.15
urnished
-0.14
odox
-0.14
Parkinson
-0.14
DataExchange
-0.14
DSL
-0.13
rophe
-0.13
ewood
-0.13
POSITIVE LOGITS
ãĥ³ãĥĶ
0.14
Wnd
0.14
_featured
0.14
ampa
0.14
izza
0.14
sic
0.14
_INSTANCE
0.14
ãĥ§
0.13
ARGET
0.13
uchos
0.13
Activations Density 0.042%