INDEX
    Explanations

    references to reviews and critical assessments of films

    New Auto-Interp
    Negative Logits
    ьаж
    -0.56
     Infórmanos
    -0.55
    CloseOperation
    -0.55
     houſe
    -0.55
     juſ
    -0.51
    EndContext
    -0.49
    queryInterface
    -0.48
    usermodel
    -0.48
    TagMode
    -0.48
    MVH
    -0.47
    POSITIVE LOGITS
     critic
    0.53
    critic
    0.47
     reviewer
    0.45
     wrote
    0.44
     Escribe
    0.42
     crítico
    0.41
     Critic
    0.41
     crítica
    0.41
     critics
    0.41
     review
    0.40
    Act Density 0.035%

    No Known Activations