INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    bass
    -0.67
    Balt
    -0.65
    inished
    -0.64
    ãĥ£
    -0.63
    Nor
    -0.63
    Pur
    -0.62
    pite
    -0.61
    bows
    -0.60
    parts
    -0.58
    ters
    -0.57
    POSITIVE LOGITS
     Allaah
    0.81
     Mandela
    0.81
     Rowling
    0.76
    croft
    0.75
     Ivanka
    0.75
     Isis
    0.73
     Canaver
    0.73
     Ney
    0.71
     Manziel
    0.71
     Christensen
    0.68
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.