<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1059</article-id>
<article-id pub-id-type="doi">10.6339/22-JDS1059</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Computing in Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>The Python Package open-crypto: A Cryptocurrency Data Collector</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Günther</surname><given-names>Steffen</given-names></name><xref ref-type="aff" rid="j_jds1059_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Fieberg</surname><given-names>Christian</given-names></name><email xlink:href="mailto:cfieberg@uni-bremen.de">cfieberg@uni-bremen.de</email><xref ref-type="aff" rid="j_jds1059_aff_002">2</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Poddig</surname><given-names>Thorsten</given-names></name><xref ref-type="aff" rid="j_jds1059_aff_001">1</xref>
</contrib>
<aff id="j_jds1059_aff_001"><label>1</label>Chair of Finance, <institution>University of Bremen</institution>, <country>Germany</country></aff>
<aff id="j_jds1059_aff_002"><label>2</label>Empirical Capital Market Research and Derivatives, <institution>University of Bremen</institution>, <country>Germany</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:cfieberg@uni-bremen.de">cfieberg@uni-bremen.de</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2023</year></pub-date><pub-date pub-type="epub"><day>10</day><month>8</month><year>2022</year></pub-date><volume>21</volume><issue>1</issue><fpage>127</fpage><lpage>144</lpage><supplementary-material id="S1" content-type="document" xlink:href="jds1059_s001.pdf" mimetype="application" mime-subtype="pdf">
<caption>
<title>Supplementary Material</title>
<p>We provide several additional information in the supplementary materials, regarding (1) further information on the installation process, (2) troubleshooting list, (3) requesting real-time data, (4) exchanges and endpoints, (5) cross-software usability from <sans-serif>R</sans-serif> and <sans-serif>Matlab</sans-serif> and (6) connectivity to server-based database management systems.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>14</day><month>4</month><year>2022</year></date><date date-type="accepted"><day>3</day><month>7</month><year>2022</year></date></history>
<permissions><copyright-statement>2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2023</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>This paper introduces the package <italic>open-crypto</italic> for free-of-charge and systematic cryptocurrency data collecting. The package supports several methods to request (1) static data, (2) real-time data and (3) historical data. It allows to retrieve data from over 100 of the most popular and liquid exchanges world-wide. New exchanges can easily be added with the help of provided templates or updated with build-in functions from the project repository. The package is available on GitHub and the <sans-serif>Python</sans-serif> package index (PyPi). The data is stored in a relational <sans-serif>SQL</sans-serif> database and therefore accessible from many different programming languages. We provide a hands-on and illustrations for each data type, explanations on the received data and also demonstrate the usability from <sans-serif>R</sans-serif> and <sans-serif>Matlab</sans-serif>. Academic research heavily relies on costly or confidential data, however, open data projects are becoming increasingly important. This project is mainly motivated to contribute to openly accessible software and free data in the cryptocurrency markets to improve transparency and reproducibility in research and any other disciplines.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>cryptocurrency</kwd>
<kwd>open data</kwd>
<kwd>Python</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1059_reflist_001">
<title>References</title>
<ref id="j_jds1059_ref_001">
<mixed-citation publication-type="other"> <string-name><surname>Aloosh</surname> <given-names>A</given-names></string-name>, <string-name><surname>Li</surname> <given-names>J</given-names></string-name> (2019). Direct evidence of bitcoin wash trading. Working paper.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_002">
<mixed-citation publication-type="chapter"> <string-name><surname>Bayer</surname> <given-names>M</given-names></string-name> (<year>2012</year>). <chapter-title>Sqlalchemy</chapter-title>. In: <source><italic>The Architecture of Open Source Applications Volume II: Structure, Scale, and a Few More Fearless Hacks</italic></source> (<string-name><given-names>A</given-names> <surname>Brown</surname></string-name>, <string-name><given-names>G</given-names> <surname>Wilson</surname></string-name>, eds.). <comment>aosabook.org</comment>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_003">
<mixed-citation publication-type="other"> <string-name><surname>Cong</surname> <given-names>L</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Tang</surname> <given-names>K</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Y</given-names></string-name> (2019). Crypto wash trading. Working paper.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Gewin</surname> <given-names>V</given-names></string-name> (<year>2016</year>). <article-title>Data sharing: An open mind on open data</article-title>. <source><italic>Nature</italic></source>, <volume>529</volume>(<issue>7584</issue>): <fpage>117</fpage>–<lpage>119</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_005">
<mixed-citation publication-type="other"> <string-name><surname>Hougan</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kim</surname> <given-names>H</given-names></string-name>, <string-name><surname>Lerner</surname> <given-names>M</given-names></string-name> (2019). Economic and non-economic trading in bitcoin: Exploring the real spot market for the world’s first digital commodity. Working Paper.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Iacus</surname> <given-names>SM</given-names></string-name> (<year>2015</year>). <article-title>Automated data collection with R – A practical guide to web scraping and text mining</article-title>. <source><italic>Journal of Statistical Software, Book Reviews</italic></source>, <volume>68</volume>(<issue>3</issue>): <fpage>1</fpage>–<lpage>3</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_007">
<mixed-citation publication-type="book"> <string-name><surname>Kim</surname> <given-names>N</given-names></string-name>, <string-name><surname>Svetlov</surname> <given-names>A</given-names></string-name> (<year>2020</year>). <source><italic>Aiohttp. Release 4.0.0a1</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Makarov</surname> <given-names>I</given-names></string-name>, <string-name><surname>Schoar</surname> <given-names>A</given-names></string-name> (<year>2020</year>). <article-title>Trading and arbitrage in cryptocurrency markets</article-title>. <source><italic>Journal of Financial Economics</italic></source>, <volume>135</volume>(<issue>2</issue>): <fpage>293</fpage>–<lpage>319</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_009">
<mixed-citation publication-type="other"> <string-name><surname>Nakamoto</surname> <given-names>S</given-names></string-name> (2008). Bitcoin: A peer-to-peer electronic cash system.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Pennec</surname> <given-names>GL</given-names></string-name>, <string-name><surname>Fiedler</surname> <given-names>I</given-names></string-name>, <string-name><surname>Ante</surname> <given-names>L</given-names></string-name> (<year>2021</year>). <article-title>Wash trading at cryptocurrency exchanges</article-title>. <source><italic>Finance Research Letters</italic></source>, <volume>43</volume>: <fpage>101982</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_011">
<mixed-citation publication-type="book"> <collab>R Core Team</collab> (<year>2020</year>). <source><italic>R: A Language and Environment for Statistical Computing</italic></source>. <publisher-name>R Foundation for Statistical Computing</publisher-name>, <publisher-loc>Vienna, Austria</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Reichman</surname> <given-names>OJ</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>MB</given-names></string-name>, <string-name><surname>Schildhauer</surname> <given-names>MP</given-names></string-name> (<year>2011</year>). <article-title>Challenges and opportunities of open data in ecology</article-title>. <source><italic>Science</italic></source>, <volume>331</volume>(<issue>6018</issue>): <fpage>703</fpage>–<lpage>705</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Szöcs</surname> <given-names>E</given-names></string-name>, <string-name><surname>Stirling</surname> <given-names>T</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>ER</given-names></string-name>, <string-name><surname>Scharmüller</surname> <given-names>A</given-names></string-name>, <string-name><surname>Schäfer</surname> <given-names>RB</given-names></string-name> (<year>2020</year>). <article-title>webchem: An R package to retrieve chemical information from the web</article-title>. <source><italic>Journal of Statistical Software</italic></source>, <volume>93</volume>(<issue>13</issue>): <fpage>1</fpage>–<lpage>17</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_014">
<mixed-citation publication-type="other"> <string-name><surname>The pandas development team</surname></string-name> (2020). pandas-dev/pandas: Pandas.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_015">
<mixed-citation publication-type="other"> <string-name><surname>van Rossum</surname> <given-names>G</given-names></string-name> (2012). Asyncio: Asynchronous IO support rebooted: The “asyncio” module.</mixed-citation>
</ref>
<ref id="j_jds1059_ref_016">
<mixed-citation publication-type="book"> <string-name><surname>van Rossum</surname> <given-names>G</given-names></string-name>, <string-name><surname>Drake</surname> <given-names>FL</given-names></string-name> (<year>2009</year>). <source><italic>Python 3 Reference Manual</italic></source>. <publisher-name>CreateSpace</publisher-name>, <publisher-loc>Scotts Valley, CA</publisher-loc>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
