INDEX
Explanations
phrases with the word "figured" followed by a number
instances of the phrase "I figured" indicating personal realizations or thoughts
New Auto-Interp
Negative Logits
idium
-0.72
pes
-0.72
Real
-0.69
vette
-0.67
ewitness
-0.65
apes
-0.65
reports
-0.65
kw
-0.65
Machina
-0.63
perty
-0.63
POSITIVE LOGITS
sonian
0.92
prominently
0.83
istically
0.80
Rasmussen
0.67
Ducks
0.65
underdog
0.63
strategically
0.63
utory
0.62
inery
0.62
Haram
0.61
Activations Density 0.026%