INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     occasionally
    0.44
     momentarily
    0.44
     forever
    0.43
     perhaps
    0.42
     wondered
    0.38
     passagem
    0.37
    бычно
    0.37
     carefully
    0.36
     implicitly
    0.36
     Occasionally
    0.36
    POSITIVE LOGITS
    Answers
    0.53
    需求
    0.49
     Answers
    0.49
    上記の
    0.49
     answers
    0.48
    answers
    0.47
     réponses
    0.47
     उपरोक्त
    0.47
     respostas
    0.46
    0.45
    Act Density 0.003%

    No Known Activations