INDEX
Explanations
instances of something happening for the first time
instances of the phrase "for the first time."
New Auto-Interp
Negative Logits
cart
-0.58
#$
-0.58
burn
-0.58
_.
-0.56
doom
-0.56
rend
-0.55
milo
-0.55
md
-0.54
bos
-0.54
killer
-0.54
POSITIVE LOGITS
time
1.42
time
1.01
times
0.90
TIME
0.88
foreseeable
0.87
Time
0.80
decade
0.80
semester
0.79
TIME
0.78
instance
0.78
Activations Density 0.040%