INDEX
Explanations
questions or statements involving the phrase "in the first place."
the phrase "in the first place."
New Auto-Interp
Negative Logits
cest
-0.68
raltar
-0.68
dr
-0.65
inal
-0.65
iami
-0.62
mens
-0.61
Doors
-0.61
Ranch
-0.60
STD
-0.59
Barney
-0.59
POSITIVE LOGITS
ãĢĤ
0.73
¶
0.71
DonaldTrump
0.67
whatsoever
0.66
.
0.65
lihood
0.65
.,
0.65
!.
0.64
!
0.64
forth
0.64
Activations Density 0.020%