INDEX
Explanations
proper nouns
proper names or significant titles within the text
New Auto-Interp
Negative Logits
âĢº
-0.90
Ying
-0.73
Hubble
-0.71
waterfall
-0.68
adop
-0.67
Roku
-0.67
ãĢĮ
-0.66
chart
-0.65
âĢ
-0.65
Recomm
-0.64
POSITIVE LOGITS
ulz
2.61
ooter
1.78
Emma
1.54
ooters
1.44
masked
1.34
ooting
1.31
nikov
1.15
olver
1.01
kinson
1.01
lag
0.98
Activations Density 0.027%