INDEX
Explanations
phrases related to ancient Greek characters or symbols
tokens that are likely technical or scientific symbols and terms
New Auto-Interp
Negative Logits
hower
-0.78
iage
-0.78
ufact
-0.74
bourg
-0.73
management
-0.72
Brow
-0.71
ilage
-0.69
folk
-0.68
ividual
-0.68
Lauder
-0.68
POSITIVE LOGITS
α
1.82
ο
1.80
Î
1.74
ÏĦ
1.70
ι
1.65
Ï
1.63
κ
1.61
á½
1.57
ε
1.54
ν
1.53
Activations Density 0.027%