INDEX
Explanations
proper nouns and names of individuals or organizations
New Auto-Interp
Negative Logits
respecting
-0.67
selves
-0.67
aging
-0.66
Geral
-0.64
bypass
-0.64
bottleneck
-0.64
cov
-0.63
KM
-0.63
impunity
-0.63
transports
-0.62
POSITIVE LOGITS
OT
1.36
BS
1.27
AD
1.26
AM
1.26
AMS
1.24
EC
1.24
UR
1.23
ARK
1.23
PS
1.23
OM
1.22
Activations Density 0.637%