INDEX
Explanations
the phrase "take it."
the repeated reference to the pronoun "it."
New Auto-Interp
Negative Logits
idth
-0.69
CHO
-0.64
avage
-0.63
"],"
-0.62
ãĥ»
-0.61
Wanted
-0.61
Passenger
-0.59
hips
-0.59
IST
-0.58
Column
-0.57
POSITIVE LOGITS
alian
1.06
self
0.94
chy
0.92
unes
0.89
iner
0.89
asca
0.85
ueller
0.79
raining
0.73
geist
0.73
MpServer
0.71
Activations Density 0.112%