{"id":197,"date":"2018-12-13T19:49:50","date_gmt":"2018-12-13T19:49:50","guid":{"rendered":"http:\/\/ma-graph.org\/?page_id=2"},"modified":"2026-05-13T16:11:22","modified_gmt":"2026-05-13T15:11:22","slug":"overview","status":"publish","type":"page","link":"https:\/\/semrepo.org\/","title":{"rendered":"Home"},"content":{"rendered":"\n<p class=\"has-medium-font-size\"><a href=\"https:\/\/semrepo.org\/\"><strong>SemRepo<\/strong><\/a>&nbsp;is a large-scale RDF knowledge graph of GitHub repositories linked to scientific research. SemRepo captures fine-grained repository-level metadata (e.g., <em>contributors, issues, programming languages<\/em>) and interlinks this with external scholarly knowledge graphs: repositories to publications in&nbsp;<a href=\"https:\/\/linkedpaperswithcode.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">LPWC<\/a>, repository authors to their profiles in&nbsp;<a href=\"https:\/\/semopenalex.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">SemOpenAlex<\/a>, and research artifacts (e.g., <em>datasets, experiments<\/em>) are linked via&nbsp;<a href=\"https:\/\/dtai-kg.github.io\/MLSea-KGC\/\" target=\"_blank\" rel=\"noreferrer noopener\">MLSea<\/a>.<\/p>\n\n\n\n<p class=\"has-medium-font-size\"><h4><strong>What exactly do we provide?<\/strong><\/h4><\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li class=\"has-medium-font-size\">Periodically updated (approx. twice per year) <a href=\"https:\/\/semrepo.org\/index.php\/data-and-code\/\" data-type=\"page\" data-id=\"25\">RDF dump files<\/a> of the SemRepo Knowledge Graph.<\/li>\n\n\n\n<li class=\"has-medium-font-size\">A publicly accessible <a href=\"https:\/\/semrepo.org\/index.php\/sparql-endpoint\/\" data-type=\"page\" data-id=\"23\" target=\"_blank\" rel=\"noreferrer noopener\">SPARQL endpoint<\/a> containing the latest SemRepo Knowledge Graph data.<\/li>\n\n\n\n<li>An <a href=\"https:\/\/github.com\/faerber-lab\/SemRepo\/\" target=\"_blank\" rel=\"noreferrer noopener\">open-source pipeline<\/a> for SemRepo construction and automatic interlinking.<\/li>\n\n\n\n<li>An <a href=\"https:\/\/semrepo.org\/index.php\/schema-linked-dataset-descriptions\/\" data-type=\"page\" data-id=\"198\" target=\"_blank\" rel=\"noreferrer noopener\">OWL ontology<\/a> with 19 classes and 47 relations for modelling research-connected software repositories.<\/li>\n\n\n\n<li><a href=\"https:\/\/semrepo.org\/index.php\/ontology\/\" data-type=\"page\" data-id=\"198\">VoID and DCAT<\/a> metadata descriptions for dataset discovery, access, and interoperability.<\/li>\n\n\n\n<li>URI resolution of the SemRepo Knowledge Graph within the Linked Open Data Cloud.<\/li>\n\n\n\n<li>Semantic interlinking with external scholarly knowledge graphs.<\/li>\n<\/ol>\n\n\n\n<p class=\"has-medium-font-size\"><h4><strong>How big is the SemRepo Knowledge Graph?<\/strong><\/h4><\/p>\n\n\n\n<p class=\"has-medium-font-size\">SemRepo.org contains (as of April 2026)*:<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83d\uddc3\ufe0f <strong>Repositories<\/strong>: 197,566<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83e\uddd1\u200d\ud83d\udcbb <strong>Contributors<\/strong>: 2,916,508<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83c\udff7\ufe0f <strong>Issues<\/strong>: 2,609,510<\/p>\n\n\n\n<p>\ud83e\udde0 <strong>Programming Language<\/strong>: 387,284<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83c\udfe2 <strong>Organizations<\/strong>: 12,879<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83e\udde0 <strong>Packages<\/strong>: 95,505<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83e\uddf5 <strong>Research Topics<\/strong>: 272,378<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83e\uddd1\u200d\ud83d\udd2c <strong>Linked SemOpenAlex Authors<\/strong>: 11,867 (see <a href=\"https:\/\/semopenalex.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/semopenalex.org<\/a>)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">\ud83d\udd17 <strong>Linked LPWC Repositories<\/strong>: 197,566 (see <a href=\"https:\/\/linkedpaperswithcode.com\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/linkedpaperswithcode.com<\/a>)<\/p>\n\n\n\n<p>\ud83d\udd17 <strong>MLSea Software entities<\/strong>: 148,185 (see <a href=\"https:\/\/w3id.org\/mlsea\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/w3id.org\/mlsea<\/a>)<\/p>\n\n\n\n<p class=\"has-medium-font-size\">*core classes only, in total SemRepo contains 19 classes.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>SemRepo&nbsp;is a large-scale RDF knowledge graph of GitHub repositories linked to scientific research. SemRepo captures fine-grained repository-level metadata (e.g., contributors, issues, programming languages) and interlinks this with external scholarly knowledge graphs: repositories to publications in&nbsp;LPWC, repository authors to their profiles in&nbsp;SemOpenAlex, and research artifacts (e.g., datasets, experiments) are linked via&nbsp;MLSea. What exactly do we provide? [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-197","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/pages\/197","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/comments?post=197"}],"version-history":[{"count":30,"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/pages\/197\/revisions"}],"predecessor-version":[{"id":578,"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/pages\/197\/revisions\/578"}],"wp:attachment":[{"href":"https:\/\/semrepo.org\/index.php\/wp-json\/wp\/v2\/media?parent=197"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}