INDEX
Explanations
the word "wild" appearing in various contexts
instances of the word "build."
New Auto-Interp
Negative Logits
SOURCE
-0.85
perature
-0.72
MIT
-0.72
displayText
-0.70
kins
-0.68
earchers
-0.68
andise
-0.65
PLAY
-0.64
Jen
-0.63
CHR
-0.63
POSITIVE LOGITS
ild
1.04
erers
0.78
sburg
0.77
rag
0.76
er
0.75
sson
0.75
ness
0.75
reth
0.74
itudinal
0.74
rance
0.73
Activations Density 0.007%