INDEX
    Explanations

    references to scientific research and medical studies

    New Auto-Interp
    Negative Logits
     houſe
    -1.02
     ſtate
    -0.92
     leaſt
    -0.87
     faſt
    -0.86
     purpoſe
    -0.85
     Houſe
    -0.84
     cauſe
    -0.83
     ſmall
    -0.83
     pleaſure
    -0.81
     ſeveral
    -0.81
    POSITIVE LOGITS
     estekak
    0.77
    abestanden
    0.69
    __':
    
    0.68
    rungsseite
    0.68
     استنادى
    0.68
    Hochspringen
    0.67
     "..\..\
    0.65
    Transkript
    0.62
    __':
    0.62
    Einzelnachweise
    0.60
    Act Density 1.482%

    No Known Activations