INDEX
    Explanations

    references to Indian films and filmmakers

    New Auto-Interp
    Negative Logits
     myſelf
    -1.62
     itſelf
    -1.52
     ſind
    -1.51
     houſe
    -1.50
     Efq
    -1.49
     raiſ
    -1.43
     iſt
    -1.38
     ſche
    -1.37
     faſt
    -1.37
     ſever
    -1.36
    POSITIVE LOGITS
     k
    0.75
    0.74
     ne
    0.74
    ...
    0.74
     v
    0.72
    0.72
     N
    0.71
     O
    0.69
     n
    0.69
    ,
    0.69
    Act Density 0.020%

    No Known Activations