INDEX
Explanations
phrases indicating intentional actions taken for a specific purpose or result
the word "just" in various contexts
New Auto-Interp
Negative Logits
ership
-0.62
lore
-0.60
eers
-0.60
aging
-0.59
apolis
-0.58
azor
-0.57
essor
-0.56
contention
-0.56
pora
-0.55
Archdemon
-0.55
POSITIVE LOGITS
ifications
1.12
ifiable
1.05
itia
0.89
if
0.85
desserts
0.82
IFIED
0.81
ices
0.79
inian
0.78
IFIC
0.76
FY
0.75
Activations Density 0.071%