INDEX
    Explanations

    phrases related to taking action or making an effort to achieve something

    New Auto-Interp
    Negative Logits
    <bos>
    -3.28
    <?
    -0.73
    /***
    
    -0.73
    -0.71
    
    
    -0.64
    /*++
    -0.64
    ///**
    -0.63
    //---
    -0.62
    <>
    
    -0.62
     Williams
    -0.60
    POSITIVE LOGITS
     stockholm
    1.37
     maneu
    1.22
     Khart
    1.19
     eiffel
    1.19
     lidl
    1.16
     Keny
    1.15
     emphat
    1.13
     sophie
    1.13
     frankfurt
    1.12
     thut
    1.11
    Act Density 0.193%

    No Known Activations