INDEX
Explanations
names of individuals
entities associated with specific actions or individuals
New Auto-Interp
Negative Logits
ASA
-0.88
paras
-0.79
ASP
-0.78
Sim
-0.78
Tib
-0.77
Sob
-0.75
impedance
-0.75
ãĥ³ãĤ¸
-0.74
acron
-0.74
Symb
-0.72
POSITIVE LOGITS
ll
1.67
LL
1.35
ill
1.28
ell
1.28
ELL
1.21
oll
1.12
yll
1.12
ILL
1.12
rell
1.11
illy
1.10
Activations Density 0.219%