INDEX
    Explanations

    the beginning of sentences or paragraphs

    Preceding or containing a number

    New Auto-Interp
    Negative Logits
     مرئيه
    -0.67
    )».
    -0.63
     NavController
    -0.62
    ()',
    -0.62
     ☐
    -0.60
    )");
    
    -0.59
    .")]
    -0.59
    \",\
    -0.58
    ).]
    -0.58
    .';
    -0.57
    POSITIVE LOGITS
    
    1.61
     Bartholomew
    0.69
     Viceroy
    0.66
     Broome
    0.65
     Jamestown
    0.64
     Juneau
    0.63
     Jin
    0.62
     Herschel
    0.62
     Bernadette
    0.62
    RegressionTest
    0.61
    Act Density 0.039%

    No Known Activations