INDEX
Explanations
slight unease or disappointment
New Auto-Interp
Negative Logits
absolutely
0.90
mindestens
0.75
⋮
0.69
災害
0.69
invester
0.67
Pivot
0.66
lava
0.65
会导致
0.65
Absolutely
0.65
Bootcamp
0.65
POSITIVE LOGITS
uneas
1.07
uneasy
1.05
expression
0.92
slightly
0.92
语气
0.91
sedikit
0.91
слегка
0.91
slightly
0.90
bitter
0.90
discontent
0.89
Activations Density 0.036%