INDEX
    Explanations

    verifier, discriminator, generator

    New Auto-Interp
    Negative Logits
     romant
    0.46
    innamon
    0.43
     joie
    0.43
     Zusammenhang
    0.42
     creations
    0.41
     abband
    0.41
     romantic
    0.40
     plaît
    0.40
     replenished
    0.40
    創造
    0.39
    POSITIVE LOGITS
     inspections
    1.24
     inspection
    1.21
    検査
    1.21
     assessing
    1.15
     проверки
    1.14
    检测
    1.13
     Inspection
    1.13
    ตรวจ
    1.13
     inspecting
    1.12
    1.10
    Act Density 0.265%

    No Known Activations