INDEX
    Explanations

    terms related to imagination and its various forms

    New Auto-Interp
    Negative Logits
    bye
    -0.70
    andra
    -0.69
    ldon
    -0.67
    upon
    -0.67
     [+
    -0.67
    paying
    -0.67
    hill
    -0.66
    hide
    -0.66
    ×IJ
    -0.66
    ishops
    -0.65
    POSITIVE LOGITS
     imagination
    1.05
     imag
    1.00
     imagin
    0.88
    issance
    0.88
     imagining
    0.82
     Balloon
    0.75
     Interpret
    0.74
     imaginative
    0.73
    ufact
    0.71
    urable
    0.71
    Act Density 0.040%

    No Known Activations