INDEX
    Explanations

    questions or queries

    New Auto-Interp
    Negative Logits
     celebr
    -0.62
     proud
    -0.62
     Khe
    -0.59
     analges
    -0.58
     inclusion
    -0.58
     migration
    -0.58
     contag
    -0.58
     invisible
    -0.57
     contagious
    -0.57
     belonging
    -0.57
    POSITIVE LOGITS
    Answer
    1.38
    Well
    1.08
    ³³³³
    0.99
    Absolutely
    0.97
    Yes
    0.95
    ³³³³³³³³³³³³³³³³
    0.91
    Honestly
    0.91
    Correct
    0.88
    ³³³³³³³³
    0.88
    Probably
    0.87
    Act Density 0.136%

    No Known Activations