data-harvesting

View on PyPIReverse Dependencies (0)

2.0.0 data_harvesting-2.0.0-py3-none-any.whl

Wheel Details

Project: data-harvesting
Version: 2.0.0
Filename: data_harvesting-2.0.0-py3-none-any.whl
Download: [link]
Size: 671730
MD5: ce223d7730c65f41592402cfbeb38619
SHA256: a34c2bf99c5ee66bbbccc99016a8a8030f63e0915916f0c1f00cd44062f0a821
Uploaded: 2024-07-09 10:09:30 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: data-harvesting
Version: 2.0.0
Summary: Set of tools to harvest, process and uplift (meta)data from metadata providers within the Helmholtz association to be included in the Helmholtz Knowledge Graph (Helmholtz-KG).
Author: Jens Bröder
Author-Email: j.broeder[at]fz-juelich.de
Maintainer: Jens Bröder
Maintainer-Email: j.broeder[at]fz-juelich.de
Home-Page: https://codebase.helmholtz.cloud/hmc/hmc-public/unhide/data_harvesting
Project-Url: Repository, https://codebase.helmholtz.cloud/hmc/hmc-public/unhide/data_harvesting.git
License: MIT
Keywords: unhide,Helmholtz association,data mining,HMC,metadata,data publications,software publication,RSE,FAIR,linked data,knowledge graph,json-ld,schema.org,restruct
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: POSIX :: Linux
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Database
Classifier: Topic :: Education
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Scientific/Engineering :: Visualization
Classifier: Topic :: Text Processing
Classifier: Typing :: Typed
Requires-Python: >=3.9,<4.0
Requires-Dist: SPARQLWrapper (<3.0.0,>=2.0.0)
Requires-Dist: advertools (<0.14.0,>=0.13.2)
Requires-Dist: atoma (<0.0.18,>=0.0.17)
Requires-Dist: extruct (<0.17.0,>=0.16.0)
Requires-Dist: jsondiff (<3.0.0,>=2.0.0)
Requires-Dist: lxml (<5.0.0)
Requires-Dist: oaiharvest (<4.0.0,>=3.0.0)
Requires-Dist: pathos (<0.4.0,>=0.3.0)
Requires-Dist: prefect (<3.0.0,>=2.16.9)
Requires-Dist: progressbar2 (<5.0.0,>=4.4.2)
Requires-Dist: pydantic (<3.0.0,>=2.3.0)
Requires-Dist: pydantic-settings (<3.0.0,>=2.2.1)
Requires-Dist: pygit2 (<2.0.0,>=1.15.0)
Requires-Dist: pyld (<3.0.0,>=2.0.3)
Requires-Dist: pyoai (==2.5.0)
Requires-Dist: pyshacl (<0.26.0,>=0.25.0)
Requires-Dist: python-crontab (<4.0.0,>=3.0.0)
Requires-Dist: python-dateutil (<3.0.0,>=2.8.2)
Requires-Dist: rdflib (<7.0.0,>=6.2.0)
Requires-Dist: requests (<3.0.0,>=2.28.1)
Requires-Dist: shapely (<3.0.0,>=2.0.1)
Requires-Dist: typer[all] (<0.13.0,>=0.12.1)
Requires-Dist: vcrpy (<7.0.0,>=6.0.1)
Requires-Dist: wrapt (<2.0.0,>=1.15.0)
Description-Content-Type: text/markdown
[Description omitted; length: 4551 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
data_harvesting/__init__.py sha256=VJIOm9UOG5JUgeiOUiwfZAQ0NUlbpjjd5eQD9WMhFcg 2391
data_harvesting/aggregator.py sha256=LrMUZIOf_ft4_TtCaRFH_FzGXyo8cEmKAc0tuHQV_8Y 7156
data_harvesting/cli/__init__.py sha256=_RVfoP_ztJjMjq_GKM4tSFdvtPrmXTXHNgpkHfI6fA4 612
data_harvesting/cli/aggregator.py sha256=ATsDy-hC9_wb75zMc8IK0FmDyQMu87mDOuHGAoUcPL8 1129
data_harvesting/cli/cli.py sha256=0p2a8t2yY3ygGoUmpCb_KTyqsIjWIrHx7NYP3Fl6BQk 1567
data_harvesting/cli/converter.py sha256=dO49EktQgxpp_ujc6SEEkrli_idSWUNGscfCL1WM6co 5389
data_harvesting/cli/cron/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
data_harvesting/cli/cron/cron sha256=uHMe5QBLA1XNA6K7K26wSNw3EL8Y5W4BhfRUvB5e3ew 419
data_harvesting/cli/cron/setup.py sha256=ikHOs3LIoPFB2OUrsF_THR3JfCGXH0WFQYHN4AJ3MeE 3766
data_harvesting/cli/datapipeline.py sha256=2XHcx12u-d4tCqJjEKhBjzIsTZZZRC2wsJE2r_B0Iv8 5566
data_harvesting/cli/harvesters.py sha256=QEGfwfPvZ-ShzTVh83g6z7ru-Evw_g6xLh_rXT2kHgE 3034
data_harvesting/cli/indexer.py sha256=UQM7G2pz768mPkYERAv9V9jS6f4cHZM-bO_Wfxabx4I 3604
data_harvesting/cli/stats.py sha256=i3bUhm_kmKfnm6hyKlbFllpEmA_kTDdP8M7qBGdOySM 540
data_harvesting/cli/util.py sha256=zF9OEGjBsP_n_5FCewrlrkSu1kL9rpHat49N4Uu13AQ 6343
data_harvesting/configs/config.yaml sha256=jh87BpVtPipGBBrGDEKmEeNXJ0ZstO_mU44j1iD9S04 8029
data_harvesting/data_enrichment/SPARQL-update.rq sha256=uS0Y5plXV7l4JKtrNAqa4E5-k4wKZWkdiJjQCnPTK2c 1569
data_harvesting/data_enrichment/generate_queries/add_organizations.py sha256=jQDY5rEybKYMatNQdTGKJXE9H6f5kvmYX5xAvEn9qmo 1879
data_harvesting/data_enrichment/generate_queries/add_organizations.rd sha256=aWBCwI41RbIGc0Tug_-c9IC46Sp2gicYuxTzJkOAmy0 2968
data_harvesting/data_enrichment/generate_queries/multiple_org_names.json sha256=er2Rkgz_fG1og-J9ri2HVfdbvd4k5rJ8HiOLjVtoa20 42772
data_harvesting/data_enrichment/generate_queries/remove_redundant_single_org.py sha256=DSgbJ5AxGmbttfyfoFl9GrEncVATeBKro3kun19BiYM 2129
data_harvesting/data_enrichment/generate_queries/remove_redundant_single_org.rd sha256=rqO9GZOIjntlGc4OB-RabP04XdFRu5vAlSm4TrcpV2I 18263
data_harvesting/data_enrichment/generate_queries/single_org_names.json sha256=tD1qJzJNwjUw-jfvG3B10U6amONQWpQo9Yqu-qYPcww 18350
data_harvesting/data_enrichment/schema_infer_org.rd sha256=uZoOiYleFOeWFIXYpu9QVQcJF0BCoIuE-npBRO1iKro 275
data_harvesting/data_enrichment/schema_infer_org_id.rd sha256=IAktPM1absjE82mjlAd0cwfFf4L4N_b7lBYpnzBtzTg 352
data_harvesting/data_enrichment/schema_infer_organization.rd sha256=uZoOiYleFOeWFIXYpu9QVQcJF0BCoIuE-npBRO1iKro 275
data_harvesting/data_enrichment/schema_infer_person.rd sha256=Btj9P7F4N0EvENX-_jYz-nASg39Xq34UXetWhiInGJ4 245
data_harvesting/data_enrichment/schema_set_org_iri.rd sha256=2LGWUcyEDmFmmUIy2f0z3WlDmNOkNMWk7Jy2uvW_rmk 566
data_harvesting/data_model.py sha256=X_9Hp-BEFEc18hiX0UJAeP8wW1t8RV8UCNKoFkfKcOE 6361
data_harvesting/external_schema/README.md sha256=VwlfmTZQBYEhHgLFm_eLyy008UTrK1ikUMwr2wx89eE 341
data_harvesting/external_schema/codemeta.jsonld sha256=EuJq7bihebmEL9dVpIz5XH7FIj24UF1qJD-Q66nyV_M 4421
data_harvesting/external_schema/context_cache.yaml sha256=p1tKBB2_Z1ouejkXJzPoCNjR60zjOhiKkhkPyIk9_hU 224639
data_harvesting/external_schema/schema_org.jsonld sha256=Ua8mKc1jpVcInqFpoZOPOmYY4tum7o6NKV2dzqNra1I 1394579
data_harvesting/external_schema/schema_org_shacl.jsonld sha256=gMInc6h4uInWi4nQoFyxUeiZdqM4qNCOWE7vnK5xPoc 1366578
data_harvesting/harvester/__init__.py sha256=H1L4zO9qyj0h-X3wWNCiFQCBZatx4Xb6h6mYZt7wsWo 1647
data_harvesting/harvester/base.py sha256=TkFBX2MEtBjKZBD7K3vb7-M-yYTybreOUJih4VM-PiA 7689
data_harvesting/harvester/datacite.py sha256=Y_gLm-do1OciRUP-kd0TZihmKZTvHb9Xtgf3bxEWzKQ 14490
data_harvesting/harvester/feed.py sha256=nkwQ5LpRYPsLIfs7Qif9FRncItOBFrD8ot-6uEDlLVE 7251
data_harvesting/harvester/git.py sha256=g0qZ3mI_3o8D7CUS8HflUDC8hEHp_Rb0gRXeyRcv040 17064
data_harvesting/harvester/gitlabs.json sha256=bbEfXBTVfWZC1csiH3dRHHJbfIj3O-4V11JCi9MF0Mk 1750
data_harvesting/harvester/indico.py sha256=xIRsuOlqvy7CE0ROJEO_cmuZjV2t7q8FN71pzYNmCGk 6812
data_harvesting/harvester/oaipmh/__init__.py sha256=_NNetne2lDhxAocdvqYiaVDQwh_bsi3s7sU70uqS-6Q 673
data_harvesting/harvester/oaipmh/constants.py sha256=-K4S_HzPQtkufjST2pC4ku20eWVPobSI_28FSFqLH1E 2391
data_harvesting/harvester/oaipmh/convert_harvest.py sha256=nkuWEiJUoKGH8ZVzHTNSIaaenn0IiHdM8MUavj_NhXA 1723
data_harvesting/harvester/oaipmh/jsonldoutput.py sha256=Vr1gwq4QQqlKOqykJQoKodoieqF9er1xfKogRUZT4DE 9323
data_harvesting/harvester/oaipmh/oai.py sha256=fpu4ZCtaOG47KBjIWNNxoKRwTLNIJdss8RbUuVAdnhM 4862
data_harvesting/harvester/sitemap.py sha256=rQFBveGXdFGToHg3oSvEHaZd_ePkAge1fMo45lEii3I 11516
data_harvesting/indexer/README.md sha256=O9_1I22joIQY1ZBKpnVgoI6-7Wy2yBKpS5WtYx10BFE 17016
data_harvesting/indexer/UNSD.Methodology.csv sha256=wK1WToT620AWjZ11nh2RiLxPInlq6VvqFZGkfsijHJA 20270
data_harvesting/indexer/__init__.py sha256=H9Echt6ZC0VrFUOc-I25tKJzToQC54NNOYswFIzlaiI 707
data_harvesting/indexer/common.py sha256=c6EUly3ZtakQ085tNzA4eP0xQxVLKZ7ylztEbNz5gm8 363
data_harvesting/indexer/conversions.py sha256=VnDJTGFAbHJiyXk804VOuJz0UyFb7UXk81q3jApvXfs 7199
data_harvesting/indexer/indexer.py sha256=RYVrbhCOXezRkjX-qrPp8YlQL0g4RMLETR_shbtILCw 39745
data_harvesting/indexer/models.py sha256=Dp31y8U5bGWfFahm4MykhBNiRZwQf2YEJuBmrgdYAwI 639
data_harvesting/indexer/regions-clipped.geojson sha256=Yu93hDdpM0wTqi587-9Gt15OrEs7eL3wT_E9-CQErkU 4536
data_harvesting/indexer/regions.py sha256=CZIhl4Sjgkp6YK9QB06H7Mh7EL24_ih4xqN65zopDsA 2953
data_harvesting/indexer/test_utils.py sha256=MNeIGFzJBD_fE_bgrH7dmS94zWqGI_2QF9DvF-7k9pg 1728
data_harvesting/model_core.py sha256=Prjqy9sTxaubs2HQX2Epqua9gzZWpmoggrp0aRm0enM 8344
data_harvesting/pipeline/__init__.py sha256=M8fk2jA756qBMfiRQSXUce_P11Neinrk2cfkW5xgmnM 1373
data_harvesting/pipeline/aggregator.py sha256=LLU-Rau4V-3Qva5WnmL0FTw0nZhWhojz4beK2CMMoNs 1948
data_harvesting/pipeline/harvester.py sha256=21Ys0sBUayWWJW_xoazaTYId11FtaslNcXoibN0H83k 6036
data_harvesting/pipeline/indexer.py sha256=DaZeiUKyibcG5_iF3EuCOv8rk-4oLDBfD2c_fDMFJF4 2981
data_harvesting/pipeline/pipeline.py sha256=B4p-GZkGH7i0k1LrpsZBKZW7Z6a6uqEKIClrfYEqKas 4964
data_harvesting/pipeline/uploader.py sha256=CjBMY0ntWoW47kIO8fK48Gp_inifM4Y2pwFRoINJILo 2736
data_harvesting/prov_tracker.py sha256=snr8AR4yrzoG_gl9ZkriXMbFg5_LBWqnjldbZS4rmdU 12374
data_harvesting/rdfpatch.py sha256=wqOTDbuLEdld3PVnDbNfU3DDaWrIXC7hTvC4MnbLuxc 8690
data_harvesting/schemas/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
data_harvesting/schemas/mappings/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
data_harvesting/schemas/mappings/map_repo.py sha256=4-jdxP1FzYwA8N20l2doA7k99bV9Hgf4Ynvaa0aaKKM 13892
data_harvesting/stats/__init__.py sha256=qUoMkk7APRZ7N5sthI3RCm2rXE3yN0HjN3R396s_Gdc 246
data_harvesting/stats/config.py sha256=IS86qKjxe97FOCYnrxa0r2qPqOb84PMeb5vBNZYxg4A 1513
data_harvesting/stats/database/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
data_harvesting/stats/database/operations.py sha256=zM59GRkm2yd7IL7yVlq3JLg6s_QSJR2AuwFLDyvMdUY 1927
data_harvesting/stats/database/setup.py sha256=WKR-px9FmoVcbKSWfc6eH5q_XHhvb6bETzSJfl_6g38 556
data_harvesting/stats/deployment.py sha256=mOQNLzDG5NqtK4lHoMp1ZH_QLAcnP13kTYao84F_iP4 1674
data_harvesting/stats/models.py sha256=EDEK2_olFabWwZOaRhzS_-NHkpgk41ChFJkYFZZQS1g 1057
data_harvesting/stats/record/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
data_harvesting/stats/record/flow.py sha256=3AzMTsyKgraFSm88L3rQ99ooI5i4VYqfXSMTd_IhNz4 1219
data_harvesting/stats/record/utils.py sha256=LXYq5EOKvpu559zbj-xXGaLWyB2gP6_UXr-UjIwTpwM 1645
data_harvesting/stats/report/__init__.py sha256=aoCTFvVhUTaWRAl3jNMCj3D7TYXKBdETJ4clSCBo4aM 366
data_harvesting/stats/report/flow.py sha256=6hSPDgfWtY8mkO-Ycb8iMZF7_S2UyXyAGPtwfG0awWQ 2397
data_harvesting/stats/report/git.py sha256=K5Pw_EKMzal446ZOFotAmFy2wsqhl99jHgfKAN9tY6I 2362
data_harvesting/util/__init__.py sha256=RCNj_NQE1QjVf8nlVZAjTrKowLLAIIQWjaflIzzW1SU 749
data_harvesting/util/config.py sha256=J5CVyJpX_Ui4tB8a4mI8t_wOkMtLHBafetZeQNB0VFs 2271
data_harvesting/util/data_model_util.py sha256=iLQo7ocUGCus84sbt5IjG4-BbAz-Qc0Lsqj4WNhXX3A 13328
data_harvesting/util/data_operation.py sha256=hNX9NDJvzhfi2PL2T4VeIRlz2vAX6Iixk0IT3YhNevk 4711
data_harvesting/util/external_schemas.py sha256=qg5u2lsDxMXBaUOUPtUWUmoiWLQ_NoYqrpBS--Kfj00 1862
data_harvesting/util/json_ld_util.py sha256=NUESuvefFbHxLJimL7IOLELAS6LIKHQThIXDEyS46F4 14365
data_harvesting/util/json_util.py sha256=ETROH2VvApEn-6KukE9rP1cIZWZSltDPrFUF8Rf28-s 7919
data_harvesting/util/map_ror.py sha256=VW5uXxQj06aOoxXGXAl5FRnj7k1cvbMB1Q1_sop-h3Q 16808
data_harvesting/util/rdf_util.py sha256=q7qOaYhktm7N9GhuwDnrzYMiyA9RWIO_0sYGTsbeeMk 3573
data_harvesting/util/sparql_util.py sha256=6zPXAvISAZDrXdfeQpRRtzwNeOu_S8RGMcQ9v_4mou4 2076
data_harvesting/util/url_util.py sha256=cJHB3THf4GX1468Tqo-EKTzMuJkJ6hnXmrR9YLKoItw 4065
data_harvesting-2.0.0.dist-info/LICENSES/CC0-1.0.txt sha256=ogEPNDSH0_dhiv_lT3ifVIdgIzHAqNA_SemnxUfPBJk 7048
data_harvesting-2.0.0.dist-info/LICENSES/MIT.txt sha256=09s9N-gfu0PnDRzNMCPYWm64BTmLApP98kv2nhpr2Vw 1287
data_harvesting-2.0.0.dist-info/METADATA sha256=pKbVGH6sFcJxfcZjjIcOnUflSQLEphF5d9S9a7l0FUg 7272
data_harvesting-2.0.0.dist-info/WHEEL sha256=sP946D7jFCHeNz5Iq4fL4Lu-PrWrFsgfLXbbkciIZwg 88
data_harvesting-2.0.0.dist-info/entry_points.txt sha256=c9HWnkDfWZcFqjF7Ks6cpbtpnzyh-s7mPd_M4uTralE 58
data_harvesting-2.0.0.dist-info/RECORD

entry_points.txt

hmc-unhide = data_harvesting.cli.cli:cli