INDEX
Explanations
phrases indicating ongoing involvement or experience
New Auto-Interp
Negative Logits
Inherits
-0.16
maal
-0.15
izer
-0.15
ÏĦÎŃ
-0.15
lingen
-0.15
IGIN
-0.15
talented
-0.15
Bec
-0.14
erton
-0.14
Mour
-0.14
POSITIVE LOGITS
around
0.19
Around
0.18
Around
0.18
fixtures
0.18
fixture
0.17
around
0.16
providing
0.15
fixture
0.15
integral
0.15
antics
0.15
Activations Density 0.049%