INDEX
Explanations
phrases indicating deep contemplation or significant thought about a specific subject
references to "it" in various contexts of decision-making or thoughts
New Auto-Interp
Negative Logits
agogue
-0.61
Vehicle
-0.61
abee
-0.57
Plate
-0.57
arthed
-0.57
Firm
-0.56
oided
-0.56
dp
-0.55
Colo
-0.55
quartered
-0.54
POSITIVE LOGITS
chy
1.22
happening
1.06
alian
1.01
self
0.93
iner
0.87
unes
0.85
myself
0.81
afterwards
0.81
happen
0.79
displayText
0.79
Activations Density 0.120%