INDEX
Explanations
phrases related to problem-solving or decision-making
occurrences of the word "figure" and its variations
New Auto-Interp
Negative Logits
ibaba
-0.84
00000
-0.74
rem
-0.66
nor
-0.64
rons
-0.63
ewitness
-0.63
VIDEO
-0.63
rary
-0.61
ription
-0.60
riott
-0.60
POSITIVE LOGITS
prominently
0.84
out
0.80
skating
0.80
istically
0.73
omething
0.66
matically
0.65
heads
0.64
ħ
0.62
sonian
0.62
how
0.60
Activations Density 0.033%