INDEX
Negative Logits
Powell
-0.07
-tw
-0.07
unw
-0.07
ognition
-0.07
ew
-0.06
Willow
-0.06
entral
-0.06
game
-0.06
Hall
-0.06
Qual
-0.06
POSITIVE LOGITS
ic
0.19
IC
0.16
mic
0.16
onic
0.14
ovic
0.13
ric
0.13
lic
0.12
vic
0.12
otic
0.12
tic
0.12
Activations Density 0.148%