INDEX
    Explanations

    phrases related to providing guidance, instructions, or encouragement to others

    New Auto-Interp
    Negative Logits
     Firstly
    -0.71
     nutshell
    -0.69
    anny
    -0.66
    velop
    -0.62
    entin
    -0.61
     Frie
    -0.61
     Owl
    -0.61
     whichever
    -0.58
     Guarant
    -0.58
     Âł Âł Âł Âł Âł Âł Âł Âł
    -0.58
    POSITIVE LOGITS
     similarly
    1.50
     similar
    1.35
     likewise
    1.18
     equally
    1.13
     same
    0.93
    similar
    0.91
    same
    0.90
    Similar
    0.89
     emulate
    0.87
     comparable
    0.85
    Act Density 0.575%

    No Known Activations