INDEX
    Explanations

    phrases related to logical reasoning or logic

    references to logic and reasoning in various contexts

    New Auto-Interp
    Negative Logits
    avez
    -0.83
     Volunte
    -0.73
    hold
    -0.67
    Shar
    -0.65
    emale
    -0.65
    ometown
    -0.63
     Leopard
    -0.62
    semble
    -0.62
    atern
    -0.61
     national
    -0.61
    POSITIVE LOGITS
     logic
    0.98
     underpin
    0.88
    matical
    0.84
    ical
    0.81
    SourceFile
    0.81
     reasoning
    0.81
    istically
    0.80
    matic
    0.77
    ophical
    0.77
     dictates
    0.75
    Act Density 0.024%

    No Known Activations