INDEX
    Explanations

    evaluative ratings and metrics related to films

    New Auto-Interp
    Negative Logits
    igram
    -0.07
    779
    -0.07
    ooter
    -0.06
    #ga
    -0.06
    ilio
    -0.06
    803
    -0.06
     place
    -0.06
     Skinner
    -0.06
     crown
    -0.06
    394
    -0.06
    POSITIVE LOGITS
    dT
    0.07
    AFX
    0.06
    ارس
    0.06
     Harness
    0.06
     roundup
    0.06
    razione
    0.06
    862
    0.06
    alian
    0.06
    seau
    0.06
    balls
    0.06
    Act Density 0.003%

    No Known Activations