INDEX
Explanations
specific symbols or fragments in programming and technical terms
New Auto-Interp
Negative Logits
velt
-0.19
feld
-0.19
fabs
-0.16
fila
-0.16
frau
-0.16
folk
-0.16
felt
-0.16
fil
-0.15
fur
-0.15
facts
-0.15
POSITIVE LOGITS
ront
0.40
eature
0.38
eatures
0.37
rame
0.36
ield
0.36
orce
0.36
irst
0.36
eed
0.35
rames
0.35
amily
0.34
Activations Density 0.088%