INDEX
    Explanations

    phrases related to memorable dialogue or quotes from movies

    New Auto-Interp
    Negative Logits
     deflate
    -0.17
    _tile
    -0.16
    ä»Ļ
    -0.15
    662
    -0.15
    erdale
    -0.15
    ông
    -0.14
    tile
    -0.14
    urette
    -0.14
    ocked
    -0.14
    еÑĢÑĤи
    -0.13
    POSITIVE LOGITS
     Termin
    0.35
     TERMIN
    0.35
     Terminator
    0.33
     termin
    0.32
     terminator
    0.32
     Schwar
    0.30
     Judgment
    0.29
     Arnold
    0.29
    termin
    0.28
     Sarah
    0.27
    Act Density 0.005%

    No Known Activations