INDEX
Explanations
phrases related to legal issues or courtroom discussions
New Auto-Interp
Negative Logits
change
-1.20
change
-0.97
switch
-0.93
CHANGE
-0.90
cambio
-0.89
Change
-0.89
shift
-0.86
changement
-0.83
changed
-0.83
changer
-0.79
POSITIVE LOGITS
متعلقه
0.83
صوتيه
0.75
참고
0.73
الاطلاع
0.72
[toxicity=0]
0.72
findpost
0.71
出版年
0.71
+#+#
0.70
שוליים
0.70
Xna
0.68
Activations Density 0.012%