INDEX
    Explanations

    people's names, their roles

    New Auto-Interp
    Negative Logits
     beeindruck
    0.92
     damal
    0.91
     überras
    0.89
     ತನ್ನ
    0.86
     знал
    0.86
     نفسه
    0.85
     bukanlah
    0.84
     himself
    0.83
     giành
    0.83
     admirers
    0.83
    POSITIVE LOGITS
     настройки
    0.76
     your
    0.72
     of
    0.68
     are
    0.66
     or
    0.66
     যারা
    0.65
     настрой
    0.63
    Configure
    0.61
    0.61
     outside
    0.60
    Act Density 0.014%

    No Known Activations