INDEX
Explanations
descriptions of actions or events happening
words and phrases related to getting information or sneak peeks
New Auto-Interp
Negative Logits
orrow
-0.76
withdrawing
-0.69
coerc
-0.67
inval
-0.66
Delete
-0.65
unemploy
-0.65
delet
-0.64
constitu
-0.63
bankrupt
-0.63
debts
-0.63
POSITIVE LOGITS
glimpse
1.77
glimps
1.70
peek
1.45
firsthand
1.35
insight
1.25
insider
1.06
taste
1.05
sneak
1.04
preview
1.01
insights
1.00
Activations Density 0.365%