INDEX
    Explanations

    references to the word "giant" along with other context-specific terms

    mentions of "giant" and its variations in contexts related to scale or size

    New Auto-Interp
    Negative Logits
    say
    -0.70
    nikov
    -0.66
    nces
    -0.65
    ggies
    -0.65
    ople
    -0.63
    rals
    -0.63
    ntax
    -0.62
    akers
    -0.62
    cause
    -0.61
    rina
    -0.61
    POSITIVE LOGITS
     squid
    1.16
     Squid
    0.89
     bould
    0.85
     leap
    0.83
     Panda
    0.80
    ess
    0.80
     ape
    0.78
     leaps
    0.78
     strides
    0.77
     Slayer
    0.76
    Act Density 0.063%

    No Known Activations