INDEX
Explanations
instances of the word "light" in different contexts
references to "light" in various contexts
New Auto-Interp
Negative Logits
halla
-0.98
ettings
-0.81
utor
-0.76
berus
-0.73
CVE
-0.68
OUP
-0.68
isner
-0.68
Merrill
-0.67
llah
-0.67
Carnegie
-0.67
POSITIVE LOGITS
weights
1.24
hearted
1.22
bulb
1.20
bul
1.20
ening
1.19
enment
1.14
bulbs
1.05
heartedly
1.05
weight
1.03
ened
1.02
Activations Density 0.028%