INDEX
    Explanations

    phrases that contain numbers or measurements

    New Auto-Interp
    Negative Logits
     repug
    -1.07
     disagre
    -0.99
     suspic
    -0.99
     viciss
    -0.99
     Leurs
    -0.98
     pamph
    -0.97
     unwarran
    -0.95
     rodriguez
    -0.94
     Souha
    -0.93
     practition
    -0.92
    POSITIVE LOGITS
     These
    0.74
     Lastly
    0.72
     Finally
    0.71
    .‏
    0.69
    ↵↵
    0.68
     Both
    0.68
     This
    0.66
     Additionally
    0.66
    ConverterFactory
    0.65
    }.
    0.63
    Act Density 0.509%

    No Known Activations