INDEX
Explanations
words associated with brightness or radiance
New Auto-Interp
Negative Logits
ãĤ¢ãĥ¼
-0.17
ionale
-0.17
ohn
-0.16
oleon
-0.16
timeofday
-0.16
odon
-0.15
othy
-0.14
æ¤
-0.14
reno
-0.14
ÑĴ
-0.14
POSITIVE LOGITS
atform
0.16
elpers
0.15
reeze
0.14
bat
0.14
inecraft
0.14
bit
0.14
@show
0.14
amet
0.14
ved
0.13
521
0.13
Activations Density 0.011%