INDEX
Explanations
phrases or names containing "le" followed by a number
instances of the word "le" in different contexts
New Auto-Interp
Negative Logits
rador
-0.99
Seym
-0.99
raints
-0.97
yrinth
-0.91
iaries
-0.89
ĸļ
-0.88
ividual
-0.86
carbohyd
-0.85
raint
-0.85
inelli
-0.83
POSITIVE LOGITS
ttes
1.20
lla
1.17
agues
1.16
phant
0.96
isure
0.96
utenant
0.89
asure
0.87
lean
0.86
eping
0.85
phrine
0.85
Activations Density 0.026%