INDEX
Explanations
numerical values and technical jargon
elements related to technical specifications and numerical data
New Auto-Interp
Negative Logits
gow
-0.66
Toledo
-0.65
wer
-0.64
Kad
-0.63
Elk
-0.63
Tant
-0.63
loo
-0.62
eer
-0.61
haw
-0.60
Haku
-0.59
POSITIVE LOGITS
1
1.38
1
1.11
ãĥĺãĥ©
1.05
2
0.94
ĥ
0.83
1001
0.76
ãĤ¹
0.76
ania
0.76
½
0.76
½
0.75
Activations Density 0.221%