INDEX
Explanations
old school and old-fashioned
New Auto-Interp
Negative Logits
young
0.48
యువ
0.45
多年
0.44
jungen
0.41
giovani
0.41
joven
0.41
modern
0.40
jovem
0.40
молодых
0.40
输出
0.39
POSITIVE LOGITS
fashioned
1.37
fashioned
1.27
timers
0.88
timers
0.86
adage
0.81
timer
0.76
school
0.71
जमाने
0.64
زمانے
0.61
enburg
0.58
Activations Density 0.030%