INDEX
Explanations
variations of the term 'acceptable'
New Auto-Interp
Negative Logits
eling
-0.21
pper
-0.16
lom
-0.15
OME
-0.14
uell
-0.14
xy
-0.14
ków
-0.14
ç¦
-0.14
elled
-0.14
590
-0.14
POSITIVE LOGITS
ably
0.20
Boeh
0.19
hay
0.19
uku
0.15
ately
0.15
ivant
0.15
Nolan
0.14
folios
0.14
sufficient
0.14
afen
0.14
Activations Density 0.017%