INDEX
    Explanations

    references to particular food items

    New Auto-Interp
    Negative Logits
     Paglinawan
    -0.64
     wineries
    -0.61
    verwijspagina
    -0.61
    مصادر
    -0.61
     BoxFit
    -0.61
     wine
    -0.60
     sherry
    -0.60
    yntaxException
    -0.60
     linens
    -0.59
    +:+
    -0.59
    POSITIVE LOGITS
     McDonald
    0.86
     burger
    0.85
     McDonalds
    0.79
    McDonald
    0.78
     burgers
    0.77
     franchise
    0.75
     fast
    0.74
    InjectAttribute
    0.74
     fries
    0.74
    🍔
    0.74
    Act Density 0.116%

    No Known Activations