INDEX
Explanations
the word "just" and its variations in context
New Auto-Interp
Negative Logits
ations
-0.66
avorite
-0.66
ophical
-0.64
atable
-0.64
isu
-0.62
ティ
-0.61
Gate
-0.61
arest
-0.60
itivity
-0.60
Kin
-0.60
POSITIVE LOGITS
bes
0.78
weeks
0.71
plug
0.68
months
0.68
hours
0.66
bet
0.65
per
0.65
ror
0.64
rebound
0.62
days
0.61
Activations Density 0.032%