INDEX
    Explanations

    mentions of the word "Win" followed by numbers that could potentially represent versions or models

    references to "Win" in various contexts

    New Auto-Interp
    Negative Logits
    âĶģ
    -0.77
    ADRA
    -0.75
    £ı
    -0.71
     condu
    -0.70
     trave
    -0.70
    ĸļ
    -0.68
     destro
    -0.68
     gravity
    -0.68
    ¿½
    -0.67
    ħĭ
    -0.67
    POSITIVE LOGITS
    ners
    1.16
    frey
    1.06
    nings
    0.99
    throp
    0.95
    fred
    0.94
    win
    0.94
    ning
    0.91
    NER
    0.86
    now
    0.83
    ces
    0.82
    Act Density 0.012%

    No Known Activations