INDEX
    Explanations

    references to specific television shows or episodes

    New Auto-Interp
    Negative Logits
    é¨İ
    -0.18
    ander
    -0.15
    æ²¢
    -0.15
     hyp
    -0.14
    surf
    -0.14
    adil
    -0.14
    FontAwesome
    -0.14
    avel
    -0.14
    _IMPLEMENT
    -0.14
    essel
    -0.14
    POSITIVE LOGITS
     Walter
    0.30
    FRING
    0.26
     Olivia
    0.26
     Peter
    0.25
     Cortex
    0.24
     Wal
    0.24
     Bishop
    0.24
    Observers
    0.24
     Observer
    0.24
     Astr
    0.24
    Act Density 0.004%

    No Known Activations