INDEX
    Explanations

    restaurant or food-related terms marked as 'signature.'

    New Auto-Interp
    Negative Logits
    -0.83
    <bos>
    -0.74
    /***
    
    -0.67
    /*
    -0.63
     conquête
    -0.62
    ///**
    -0.60
    /**
    -0.59
     avoid
    -0.56
     encourage
    -0.55
    //
    -0.52
    POSITIVE LOGITS
     signature
    2.82
     signatures
    2.55
     Signature
    2.54
    signature
    2.51
     Signatures
    2.38
    Signature
    2.29
     SIGNATURE
    2.28
    signatures
    2.06
    SIGNATURE
    1.87
    Signatures
    1.83
    Act Density 0.147%

    No Known Activations