INDEX
Negative Logits
爱好者
0.47
Airbnb
0.44
餐饮
0.41
Rental
0.40
ethical
0.38
Mentor
0.38
სპეცი
0.37
ოში
0.37
๊ะ
0.37
Amazing
0.36
POSITIVE LOGITS
again
0.56
trivially
0.56
Therefore
0.55
Again
0.50
puisque
0.50
->
0.50
weer
0.50
(-
0.48
wiederum
0.48
Substituting
0.47
Activations Density 0.079%