INDEX
Explanations
numerical estimates and quantities
New Auto-Interp
Negative Logits
iban
-0.70
hee
-0.64
olin
-0.61
sson
-0.60
ighthouse
-0.59
cation
-0.59
bis
-0.58
ppo
-0.58
urai
-0.57
ESE
-0.56
POSITIVE LOGITS
upwards
0.94
tens
0.88
thirty
0.82
fifteen
0.82
seventy
0.82
undreds
0.79
anywhere
0.79
sixty
0.78
twenty
0.77
approximately
0.76
Activations Density 0.376%