INDEX
Explanations
key conjunctions and relational language in the text
New Auto-Interp
Negative Logits
vp
-0.14
imon
-0.14
.Diagnostics
-0.14
ãĥ¼ãĥĦ
-0.14
entry
-0.14
angu
-0.13
Entry
-0.13
ative
-0.13
Natural
-0.13
ephy
-0.13
POSITIVE LOGITS
ment
0.16
VIS
0.16
Visibility
0.16
alent
0.15
Vis
0.15
viscosity
0.15
VIS
0.15
ÑĥÑĪ
0.14
hy
0.14
hybrid
0.14
Activations Density 0.026%