-
Notifications
You must be signed in to change notification settings - Fork 18
support Dominion format #119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@gngilbert @CalebKleppner do you know how we can get this data? |
I believe I have the data from Santa Fe. I will try to get it uploaded tomorrow and we can discuss it. I could not figure it out. We may have to talk to Dominion. |
I think I have attached the Santa Fe CVR files here. Not sure how this works. If they are not attached, I can send them attached to an e-mail. |
Thanks - fascinating. |
That was my reaction as well and why we have not taken on the task of converting it to run in the RCVRC Tabulator. At some point, however, we need to do this and I am looking for recommendations as to how to do this in the most efficient manner. |
Jon, should you bring David in on this issue? (I would but don't know how. Thanks.) |
@davidryal |
We'll keep this on the shelf for now. |
Pedro at FairVote has been working with the San Francisco CVRs I'm attaching to this post, and developed this process for converting the JSON in to a human readable format: |
More questions from @moldover: do we need to handle multiple contests? |
I'm not familiar enough with any of this to provide answers. Do we need to
bring Keith in on this?
George Gilbert
RCV Resource Center
george.gilbert@rankedchoicevoting.org
336-906-0047
…On Mon, Feb 10, 2020 at 1:57 AM Louis Eisenberg ***@***.***> wrote:
More questions from @moldover <https://github.com/moldover>:
do we need to handle multiple contests?
how are column headers parsed? there's a bunch of different text:
Original/Cards/0/Contests/0/Marks/0/Rank
Original/Cards/0/PaperIndex
Original/Cards/0/Contests/0/Marks/8/MarkDensity
do we need to interpret selections, using outstack condition manifest?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#119?email_source=notifications&email_token=AJODE7QHKLKL3YCOS24MLLDRCD3FTA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELHN2II#issuecomment-583982369>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AJODE7VTQPT3DLSDN3B3MWDRCD3FTANCNFSM4FPFLTVA>
.
|
These are the same questions that Jon asked previously over email; I'm just reproducing them here to keep things organized. Presumably Keith has already shared them with Dominion folks. |
Thanks, Louis.
George Gilbert
RCV Resource Center
george.gilbert@rankedchoicevoting.org
336-906-0047
…On Tue, Feb 11, 2020 at 11:49 AM Louis Eisenberg ***@***.***> wrote:
These are the same questions that Jon asked previously over email; I'm
just reproducing them here to keep things organized. Presumably Keith has
already shared them with Dominion folks.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#119?email_source=notifications&email_token=AJODE7XQA5EDC2Q4HI4MMIDRCLJLNA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELNFCEQ#issuecomment-584732946>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AJODE7WKJHTJQQBKFUV4ZBTRCLJLNANCNFSM4FPFLTVA>
.
|
Thanks for digging out the data and a parsing method. I'm pretty sure I've parsed similar Dominion JSON files before in Python. Also, as yet another approach, the RLA (SHANGRLA) audit of the election seems to have used the Javascript in this file to parse out just the votes on each ballot in a given contest: https://github.com/pbstark/SHANGRLA/blob/master/ConvertCVRToRAIRE.html
Yielding this output file for Mayor But I've only glanced at that stuff - I might have misinterpreted something there.... |
All -- I see that you've gotten
started on working with Dominion json files for CVRS. This is
good, as I'm sure hope you've heard that the Dem RCV contests in
Alaska, Wyoming and Kansas have recently switched from ClearBallot
to Dominion equipment. Yes, we are just 7 weeks away from the
elections, and the contractor has changed equipment. This means
that the cast vote records will be in the json format that San
Francisco uses (
https://www.sfelections.org/results/20191105/data/20191125/CVR_Export_20191125163446.zip
).
I will probably be conducting the RCV tallies for these 3 states
and will be relying on the Universal Tabulator. I'm very eager to
start testing it with San Francisco data.
Should I plan on obtaining CVRs in Dominion json format,
converting them to a human readable format as Pedro worked out
below, and then tabulating in the Universal Tabulator?
I believe that I'll be receiving a
large set of CVRs that include both mail ballots and precinct
ballots, so it will be necessary for me to break the CVRs into
separate files based on county (Wyoming) or CD (Kansas).
Thanks for any info you can provide. I'm happy to discuss anytime
(203-376-4080, ck@fairvote.org).
Best,
Caleb
Caleb Kleppner
203-376-4080
On 1/28/2020 10:21 AM, chughes297
wrote:
Pedro at FairVote has been working with the San Francisco CVRs
I'm attaching to this post, and developed this process for
converting the JSON in to a human readable format:
https://docs.google.com/document/d/1uR94xFn-oB3B_17lftP2gZkLZAGtvs6Wu5rw2vryDsE/edit?usp=sharing.
Wanted to share in case it's useful as you guys get started on
Dominion
CVR_Export_20191125163446.zip
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
[
{
"@context": "http://schema.org",
"@type": "EmailMessage",
"potentialAction": {
"@type": "ViewAction",
"target": "#119?email_source=notifications\u0026email_token=AJ4HUXHFKSGUOXAWXXXEBA3RABEOZA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEKDV5IA#issuecomment-579296928",
"url": "#119?email_source=notifications\u0026email_token=AJ4HUXHFKSGUOXAWXXXEBA3RABEOZA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEKDV5IA#issuecomment-579296928",
"name": "View Issue"
},
"description": "View this Issue on GitHub",
"publisher": {
"@type": "Organization",
"name": "GitHub",
"url": "https://github.com"
}
}
]
|
All, obviously this is very short notice on this change of vendor. I think
we need to plan a conference call with Caleb for later this week. I'll ask
chris to send out a doodle poll.
George Gilbert
RCV Resource Center
george.gilbert@rankedchoicevoting.org
336-906-0047
On Mon, Feb 17, 2020 at 1:43 PM CalebKleppner <notifications@github.com>
wrote:
…
All -- I see that you've gotten
started on working with Dominion json files for CVRS. This is
good, as I'm sure hope you've heard that the Dem RCV contests in
Alaska, Wyoming and Kansas have recently switched from ClearBallot
to Dominion equipment. Yes, we are just 7 weeks away from the
elections, and the contractor has changed equipment. This means
that the cast vote records will be in the json format that San
Francisco uses (
https://www.sfelections.org/results/20191105/data/20191125/CVR_Export_20191125163446.zip
).
I will probably be conducting the RCV tallies for these 3 states
and will be relying on the Universal Tabulator. I'm very eager to
start testing it with San Francisco data.
Should I plan on obtaining CVRs in Dominion json format,
converting them to a human readable format as Pedro worked out
below, and then tabulating in the Universal Tabulator?
I believe that I'll be receiving a
large set of CVRs that include both mail ballots and precinct
ballots, so it will be necessary for me to break the CVRs into
separate files based on county (Wyoming) or CD (Kansas).
Thanks for any info you can provide. I'm happy to discuss anytime
(203-376-4080, ***@***.***).
Best,
Caleb
Caleb Kleppner
203-376-4080
On 1/28/2020 10:21 AM, chughes297
wrote:
Pedro at FairVote has been working with the San Francisco CVRs
I'm attaching to this post, and developed this process for
converting the JSON in to a human readable format:
https://docs.google.com/document/d/1uR94xFn-oB3B_17lftP2gZkLZAGtvs6Wu5rw2vryDsE/edit?usp=sharing
.
Wanted to share in case it's useful as you guys get started on
Dominion
CVR_Export_20191125163446.zip
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
[
{
***@***.***": "http://schema.org",
***@***.***": "EmailMessage",
"potentialAction": {
***@***.***": "ViewAction",
"target": "
#119?email_source=notifications\u0026email_token=AJ4HUXHFKSGUOXAWXXXEBA3RABEOZA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEKDV5IA#issuecomment-579296928
",
"url": "
#119?email_source=notifications\u0026email_token=AJ4HUXHFKSGUOXAWXXXEBA3RABEOZA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEKDV5IA#issuecomment-579296928
",
"name": "View Issue"
},
"description": "View this Issue on GitHub",
"publisher": {
***@***.***": "Organization",
"name": "GitHub",
"url": "https://github.com"
}
}
]
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#119?email_source=notifications&email_token=AJODE7TJKLJPUNZFX6IQ7WLRDLLDPA5CNFSM4FPFLTVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEL7LGRI#issuecomment-587117381>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AJODE7R7VDNPAT2FF3DRGYDRDLLDPANCNFSM4FPFLTVA>
.
|
Here's a parser that @catrope wrote for the new format: https://github.com/catrope/sf-rcv/blob/master/parse-new-format.js And the rest of the code in that repo handles the old format. More explanation from him: I apologize for the lack of documentation, so I'll briefly explain it here instead. $ wget https://www.sfelections.org/results/20191105/data/20191114/CVR_Export_20191114160248.zip The attached screenshot illustrates what the output looks like. It's a big array, where every element is a ballot card, and every race is an array of choices. For RCV contests, the first element is the first choice, the second element the second choice, etc; for non-RCV contests (including measures), there is only one element. If a choice is null that means it was left blank (undervote), if the choice is itself an array of multiple values that means multiple choices were selected (overvote). So in the attached example, the District Attorney rankings were 1) blank, 2) Loftus, 3) overvote for both Tung and Boudin, 4) Dautch, and their Mayor rankings were 1) Breed, 2) Pang, 3) Ventresca, 4) blank, 5) Zhou, 6) Jordan+Robertson overvote. I haven't yet adapted my RCV code to ingest this format, but it shouldn't be too much work, and the format should be relatively easy to deal with for other scripts as well. Since the data for non-RCV races is also all there, you should also be able to compute correlations between contests that appear on the same card (e.g. local measures: how many people voted Yes on A but No on E or vice versa, and where were they located?). One thing I want to look at at some point is the geographic distribution of Nancy Tung's transferred votes: Tung->Loftus, Tung->Boudin and Tung->exhausted were each over 30%, and I'm curious to see if those three groups are concentrated anywhere in particular. I also want to look at the second choices of Boudin and Loftus voters. |
Latest update: I believe we mostly have this issue addressed with the closing of #404, #406, #407, #408, and #415. Remaining related open issues (which could probably supersede the need to keep this one open): #434, #437, #438. @moldover @tarheel, could you please look closely over this issue and file any other necessary issues to address any last loose ends here? Then I think we should be good to close it. |
Sounds right to me. Will let @moldover make the final call. |
Closed via #470 |
George's comment from #96: "I believe the guys at FairVote have the Dominion data from Santa
Fe."
The text was updated successfully, but these errors were encountered: