INDEX
Explanations
quotations in text
punctuation or dialogue attributes in sentences
New Auto-Interp
Negative Logits
jungle
-0.90
Boko
-0.86
ninja
-0.78
villages
-0.78
Bangkok
-0.75
mosquito
-0.75
¥µ
-0.74
totem
-0.74
BAT
-0.73
village
-0.72
POSITIVE LOGITS
Stein
1.64
Schwe
1.29
Ste
1.27
Ste
1.23
Schwar
1.16
Sch
1.14
stein
1.14
Sche
1.14
Wein
1.14
STE
1.11
Activations Density 0.365%