Re: DEP 5 and directory/file names with spaces
ti, 2009-06-09 kello 00:47 -0700, Steve Langasek kirjoitti:
> Sorry, this is an incorrect test; on closer examination, the upstream
> tarballs aren't unpacked in the lintian lab. The actual count (from a
> recursive tar ztvf) is
>
> commas: 1032
> spaces: 7305
> total files: 4180690
>
> By number of affected packages, this is:
>
> commas: 57
> spaces: 310
For further amusement (I want my shed painted pink, with polkadots in
lime-green), attached please find a) a script to compute frequencies of
bytes in filenaems in all lenny main source packages b) the output of
the script.
I unpacked the source packages using the unpack-debian-sources script in
http://code.liw.fi/extrautils/bzr/trunk/ (bzr branch); I apologize for
not having made a release that includes the script yet.
Some highlights:
* two carriage return chars (\r)
* one escape char
* 5431 spaces
* 1 double quotes (")
* 98 single quotes (')
* 64 asterisks (*)
* 524 commas
* 3 backslashes
* 51601 percent chars (%)
> > This is bikeshedding in the extreme.
>
> But I think this still stands.
I changed my mind. I want black with skulls and crossbones in silver,
plus bunnies in gold.
#!/usr/bin/python
import os
import sys
counts = {}
for top in sys.argv[1:]:
for dirname, dirnames, basenames in os.walk(top):
for x in basenames:
for c in x:
counts[c] = counts.get(c, 0) + 1
for c in sorted(counts.keys()):
print "%3d %s %d" % (ord(c), repr(c), counts[c])
13 '\r' 2
27 '\x1b' 1
32 ' ' 5431
33 '!' 81
34 '"' 1
35 '#' 342
36 '$' 1049
37 '%' 51601
38 '&' 166
39 "'" 98
40 '(' 348
41 ')' 330
42 '*' 64
43 '+' 7143
44 ',' 524
45 '-' 834270
46 '.' 3237353
48 '0' 535175
49 '1' 516998
50 '2' 478215
51 '3' 229333
52 '4' 317429
53 '5' 163675
54 '6' 214617
55 '7' 124864
56 '8' 156637
57 '9' 87415
58 ':' 883
59 ';' 9
60 '<' 57
61 '=' 597
62 '>' 66
63 '?' 27
64 '@' 1748
65 'A' 164851
66 'B' 79193
67 'C' 201391
68 'D' 156687
69 'E' 204747
70 'F' 91842
71 'G' 78624
72 'H' 55258
73 'I' 149702
74 'J' 18033
75 'K' 32199
76 'L' 131432
77 'M' 280240
78 'N' 99264
79 'O' 90816
80 'P' 141881
81 'Q' 16678
82 'R' 159067
83 'S' 230612
84 'T' 176242
85 'U' 57109
86 'V' 36488
87 'W' 38912
88 'X' 32973
89 'Y' 16593
90 'Z' 5389
91 '[' 111
92 '\\' 3
93 ']' 110
94 '^' 157
95 '_' 1178360
96 '`' 81
97 'a' 2554343
98 'b' 671246
99 'c' 2171257
100 'd' 1236447
101 'e' 3577704
102 'f' 875451
103 'g' 1287462
104 'h' 1269265
105 'i' 2388345
106 'j' 183252
107 'k' 579332
108 'l' 2151027
109 'm' 1606322
110 'n' 2218732
111 'o' 2109188
112 'p' 2131207
113 'q' 103211
114 'r' 2040057
115 's' 2334579
116 't' 2933157
117 'u' 880994
118 'v' 484188
119 'w' 315390
120 'x' 663838
121 'y' 444930
122 'z' 116657
123 '{' 1
124 '|' 2
125 '}' 1
126 '~' 2272
128 '\x80' 23
129 '\x81' 5
130 '\x82' 6
131 '\x83' 10
132 '\x84' 26
133 '\x85' 2
134 '\x86' 7
136 '\x88' 4
138 '\x8a' 7
139 '\x8b' 12
140 '\x8c' 17
141 '\x8d' 1
143 '\x8f' 4
144 '\x90' 2
145 '\x91' 3
147 '\x93' 5
148 '\x94' 20
149 '\x95' 11
150 '\x96' 1
151 '\x97' 6
152 '\x98' 4
153 '\x99' 8
154 '\x9a' 3
155 '\x9b' 1
156 '\x9c' 22
157 '\x9d' 18
158 '\x9e' 2
159 '\x9f' 5
160 '\xa0' 20
161 '\xa1' 9
162 '\xa2' 1
163 '\xa3' 1
164 '\xa4' 15
165 '\xa5' 4
166 '\xa6' 1
167 '\xa7' 9
168 '\xa8' 16
169 '\xa9' 11
170 '\xaa' 3
171 '\xab' 1
172 '\xac' 11
173 '\xad' 4
174 '\xae' 3
175 '\xaf' 1
176 '\xb0' 12
177 '\xb1' 2
178 '\xb2' 13
179 '\xb3' 11
180 '\xb4' 24
181 '\xb5' 5
182 '\xb6' 8
183 '\xb7' 5
184 '\xb8' 9
185 '\xb9' 6
186 '\xba' 2
187 '\xbb' 3
188 '\xbc' 12
189 '\xbd' 10
191 '\xbf' 1
195 '\xc3' 44
197 '\xc5' 2
204 '\xcc' 9
212 '\xd4' 3
221 '\xdd' 3
223 '\xdf' 2
226 '\xe2' 17
227 '\xe3' 4
228 '\xe4' 9
229 '\xe5' 1
232 '\xe8' 1
233 '\xe9' 13
234 '\xea' 24
235 '\xeb' 60
236 '\xec' 91
237 '\xed' 22
238 '\xee' 1
239 '\xef' 5
241 '\xf1' 1
244 '\xf4' 3
246 '\xf6' 5
248 '\xf8' 1
252 '\xfc' 1
254 '\xfe' 3
Reply to: