INDEX
Explanations
the word "Reason" and related forms
instances and variations of the word "reason"
New Auto-Interp
Negative Logits
avorite
-0.82
ibaba
-0.79
omez
-0.72
Carbuncle
-0.72
yss
-0.68
wana
-0.67
annis
-0.65
ModLoader
-0.63
Watt
-0.62
Wink
-0.62
POSITIVE LOGITS
abl
1.16
why
0.94
ably
0.88
boards
0.78
neum
0.77
Ľ
0.74
orial
0.73
¿½
0.73
WHY
0.71
atic
0.71
Activations Density 0.025%