INDEX
Explanations
occurrences of the word "time" with high activation values
phrases that refer to "a time"
New Auto-Interp
Negative Logits
avorite
-0.92
uctions
-0.74
inav
-0.67
ging
-0.67
deals
-0.64
chances
-0.64
Hits
-0.63
ussions
-0.62
emort
-0.60
ipation
-0.59
POSITIVE LOGITS
frame
0.79
zone
0.78
frames
0.73
女
0.72
neck
0.72
lot
0.70
è¦ļéĨĴ
0.70
glass
0.70
pring
0.70
Variable
0.69
Activations Density 0.019%