INDEX
    Explanations

    pretty followed by adjective

    New Auto-Interp
    Negative Logits
     AND
    0.64
     and
    0.59
    ll
    0.59
     STATEMENT
    0.57
     Phillies
    0.56
     Provide
    0.55
     WITH
    0.54
    0.52
     furthering
    0.52
     TARGET
    0.52
    POSITIVE LOGITS
     khá
    0.65
    不錯
    0.65
     довольно
    0.64
     decently
    0.61
     непло
    0.61
     reasonably
    0.60
     достаточно
    0.60
     ziemlich
    0.60
     dość
    0.59
     bastante
    0.57
    Act Density 0.061%

    No Known Activations