INDEX
Explanations
proper nouns or specific names of entities
specific named entities or proper nouns
New Auto-Interp
Negative Logits
ĸļ
-0.93
enegger
-0.81
rawdownloadcloneembedreportprint
-0.77
hement
-0.72
anguage
-0.71
emonium
-0.67
Panzer
-0.67
itent
-0.66
mble
-0.66
eers
-0.65
POSITIVE LOGITS
½
0.80
Share
0.69
·
0.69
¶
0.69
«
0.69
¿
0.68
³
0.67
Ĵ
0.66
µ
0.66
ģ
0.65
Activations Density 0.377%