INDEX
    Explanations

    names and roles of actors in movies

    New Auto-Interp
    Negative Logits
    urs
    -0.15
    ur
    -0.15
    oku
    -0.14
    .Magenta
    -0.14
    ugin
    -0.14
    ãĥ³ãĥĩãĤ£
    -0.14
    urge
    -0.14
    okus
    -0.14
    ardown
    -0.14
    incerely
    -0.13
    POSITIVE LOGITS
     pii
    0.17
     fov
    0.15
    .tom
    0.14
    AML
    0.14
     famously
    0.14
    orman
    0.14
    Touches
    0.14
    herits
    0.14
     verbal
    0.13
    #ad
    0.13
    Act Density 0.123%

    No Known Activations