INDEX
    Explanations

    references to bars and related establishments

    New Auto-Interp
    Negative Logits
    '")
    -0.69
    __))
    -0.68
    -0.67
    Aktualisiert
    -0.66
    -0.66
    ')):
    -0.62
    })`
    -0.62
    ]');
    -0.61
    --}}
    -0.61
    )]$
    -0.61
    POSITIVE LOGITS
    BAR
    1.21
     bar
    1.14
     BAR
    1.12
    bar
    1.12
    Bar
    1.07
     Bar
    1.06
    ibar
    1.02
     bars
    1.01
    Bars
    1.00
    IBar
    0.98
    Act Density 0.154%

    No Known Activations