INDEX
Explanations
instances where the phrase "the first time that" occurs
New Auto-Interp
Negative Logits
aciously
-0.74
cosystem
-0.64
estern
-0.62
roth
-0.62
Guard
-0.59
usk
-0.58
leeve
-0.58
IDs
-0.58
IVERS
-0.57
orah
-0.57
POSITIVE LOGITS
occurs
0.93
happens
0.90
soever
0.84
they
0.79
occurred
0.79
happened
0.77
arose
0.76
mattered
0.76
transpired
0.71
we
0.70
Activations Density 0.123%