INDEX
Explanations
verbs and related terms associated with instruction, construction, and infection
New Auto-Interp
Negative Logits
ed
-0.23
ISTIC
-0.19
ftar
-0.18
edBy
-0.17
ALLY
-0.16
istically
-0.16
atoria
-0.16
ality
-0.16
neys
-0.15
awaiter
-0.15
POSITIVE LOGITS
ive
0.52
ors
0.48
ivity
0.44
ively
0.44
IVE
0.37
ives
0.37
iveness
0.35
ible
0.31
ORS
0.30
ivo
0.29
Activations Density 0.084%