INDEX
    Explanations

    phrases related to popular movies, TV shows, and characters

    New Auto-Interp
    Negative Logits
    STEM
    -0.70
    ILLE
    -0.66
    ONY
    -0.66
    uliffe
    -0.65
    arette
    -0.64
     Nieto
    -0.63
    nown
    -0.60
    mpeg
    -0.59
    */(
    -0.59
    agents
    -0.59
    POSITIVE LOGITS
     Admir
    0.82
     Rings
    0.77
    warts
    0.69
     Apostles
    0.66
     Thieves
    0.65
    swer
    0.65
     Prayer
    0.64
     Canterbury
    0.62
     Sheep
    0.61
     Scrolls
    0.60
    Act Density 0.042%

    No Known Activations