Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1190 commits
Select commit Hold shift + click to select a range
2ae466f
ok: Update surya-bench.yaml
cstner Dec 1, 2025
b085ebf
Merge branch 'main' into main
cstner Dec 1, 2025
3c50fa9
ok: Update planette_era5_reanalysis.yaml
cstner Dec 1, 2025
06dbce2
ok: Merge pull request #2957 from AodhanSweeney/main
cstner Dec 1, 2025
aaff94d
Merge branch 'main' into main
berylrab Dec 1, 2025
16c8c7a
ok: Update hprc-epigenome.yaml
berylrab Dec 1, 2025
9b13c3e
ok: Merge pull request #2932 from lidaof/main
berylrab Dec 1, 2025
7c4ef04
Merge branch 'main' into main
berylrab Dec 1, 2025
cb8eae9
Adding resource details.
Arun-George-Zachariah Dec 1, 2025
d4a7662
Merge branch 'main' into dynamical-hrrr-gfs-ens
cstner Dec 1, 2025
cadf9dc
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 1, 2025
4d0fe33
ok: Update dynamical-noaa-hrrr.yaml
cstner Dec 1, 2025
f512dfb
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 1, 2025
d6bc081
ok: Update dynamical-noaa-gfs.yaml
cstner Dec 1, 2025
090fff6
ok: Update dynamical-noaa-hrrr.yaml
cstner Dec 1, 2025
27ac4d0
ok: Update dynamical-noaa-gfs.yaml
cstner Dec 1, 2025
a2074a0
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 1, 2025
4c1584c
ok: Update dynamical-noaa-gfs.yaml
cstner Dec 1, 2025
656952f
ok: Update dynamical-noaa-hrrr.yaml
cstner Dec 2, 2025
3291cb5
ok: Update dynamical-noaa-gfs.yaml
cstner Dec 2, 2025
90b7973
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 2, 2025
43a745f
Updating Notebook URL.
Arun-George-Zachariah Dec 2, 2025
e867aef
Updating the tutorial repository.
Arun-George-Zachariah Dec 2, 2025
e261939
Adding webpaget to request access.
Arun-George-Zachariah Dec 2, 2025
a53c0f7
Add `URL` for tutorials, SNS topic, and clean up description whitespace
aldenks Dec 2, 2025
271e502
ok: Update asl_1000.yaml
berylrab Dec 2, 2025
8774ff7
ok: Update asl_1000.yaml
berylrab Dec 2, 2025
4d70be6
ok: Update asl_1000.yaml
berylrab Dec 2, 2025
4e0e170
Updating the license.
Arun-George-Zachariah Dec 2, 2025
53deb7e
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 2, 2025
0f057e5
ok: Update asl_1000.yaml
berylrab Dec 2, 2025
e3082bd
ok: Merge pull request #2947 from Arun-George-Zachariah/main
berylrab Dec 2, 2025
0a3e13d
Merge branch 'main' into dynamical-hrrr-gfs-ens
cstner Dec 2, 2025
ea61a2d
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 2, 2025
412575f
ok: Merge pull request #2955 from dynamical-org/dynamical-hrrr-gfs-ens
cstner Dec 2, 2025
1c75fe6
Fix links to dynamical.org model documentation pages
aldenks Dec 2, 2025
5159b99
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 2, 2025
42dced9
ok: Merge pull request #2959 from dynamical-org/dynamical-model-docs-…
cstner Dec 2, 2025
7e00721
add open targets dataset yaml
remo87 Dec 3, 2025
6d7b9e1
update pub url to doi
remo87 Dec 3, 2025
81337f1
update pub title
remo87 Dec 3, 2025
6ae54a4
Merge branch 'main' into add-alliance-genome-resources
berylrab Dec 3, 2025
ef9fde2
Merge branch 'main' into main
berylrab Dec 3, 2025
8d12c2a
Update igvf-consortium.yaml
hitz Dec 3, 2025
53139dc
ok: Update igvf-consortium.yaml
berylrab Dec 3, 2025
78b29ea
Merge branch 'main' into igvf-consortium
berylrab Dec 3, 2025
d069d54
search fix - cannot use div tags in description body
cstner Dec 4, 2025
be7a86a
ok: Update dynamical-ecmwf-ifs-ens.yaml
cstner Dec 4, 2025
34e2576
ok: Merge pull request #2963 from awslabs/search-fix
cstner Dec 4, 2025
63b1cb7
Add US Tidal dataset information to marine-energy-data.yaml
alowney Dec 4, 2025
f11d8ff
ok: Update marine-energy-data.yaml
cstner Dec 4, 2025
24df182
ok: Merge pull request #2964 from alowney/patch-35
cstner Dec 4, 2025
d95be7a
Merge branch 'main' into add-flab
berylrab Dec 4, 2025
e61cf7b
ok: Update igvf-consortium.yaml
berylrab Dec 4, 2025
1feff65
Merge branch 'main' into igvf-consortium
berylrab Dec 4, 2025
378de48
ok: Update igvf-consortium.yaml
berylrab Dec 4, 2025
d3e02e8
ok: Update igvf-consortium.yaml
berylrab Dec 4, 2025
87ea8a5
ok: Merge pull request #2859 from IGVF-DACC/igvf-consortium
berylrab Dec 4, 2025
03b914d
Merge branch 'main' into add-flab
berylrab Dec 4, 2025
0d39fb0
ok: Update flab.yaml
berylrab Dec 4, 2025
6f510c2
ok: Update flab.yaml
berylrab Dec 4, 2025
8bf96f6
ok: Merge pull request #2836 from MichaelChungyoun/add-flab
berylrab Dec 4, 2025
7ff5193
Update bucket name from mod-datadumps to alliance-genome-downloads
christabone Dec 5, 2025
e3e04fe
Merge branch 'main' into add-alliance-genome-resources
berylrab Dec 5, 2025
f575817
feat: Add new tutorial details for Common Crawl dataset
wumpus Dec 7, 2025
d47ad65
Merge branch 'main' into main
japan-pointcloud Dec 8, 2025
da6e8ce
Merge branch 'main' into main
kanagawa-pointcloud Dec 8, 2025
7d3a9b2
ok: ready to merge
pschmied Dec 8, 2025
c6df1d0
Remove duplicate Tutorials section
pschmied Dec 8, 2025
453c86a
ok: to merge
pschmied Dec 8, 2025
5c12df2
Merge pull request #2965 from wumpus/patch-1
pschmied Dec 8, 2025
6932612
Merge branch 'main' into main
cstner Dec 8, 2025
0d4538f
ok: Update kanagawa_pointcloud.yaml
cstner Dec 8, 2025
8a406cd
Merge branch 'main' into main
cstner Dec 8, 2025
178cc1e
Add SNS topic and browse.
xhagrg Dec 8, 2025
1babebf
Update browse url.
xhagrg Dec 8, 2025
a0e5ac2
Merge branch 'main' into dataset-surya_bench
xhagrg Dec 8, 2025
bee02c0
ok: Update surya-bench.yaml
cstner Dec 8, 2025
ffb61aa
ok: Merge pull request #2708 from NASA-IMPACT/dataset-surya_bench
cstner Dec 8, 2025
84272c1
Update ManagedBy field in gdr-data-lake.yaml
alowney Dec 8, 2025
c9a0428
Update ManagedBy field in oedi-data-lake.yaml
alowney Dec 8, 2025
66274b5
Update ManagedBy field in marine-energy-data.yaml
alowney Dec 8, 2025
9812e3a
Update ManagedBy field
alowney Dec 8, 2025
6f542c8
Update ManagedBy field in dsgrid.yaml
alowney Dec 8, 2025
03dde62
Update ManagedBy field in nrel-pds-ncdb.yaml
alowney Dec 8, 2025
2c56442
Update managed by field in nrel-pds-nsrdb.yaml
alowney Dec 8, 2025
b54568f
Update ManagedBy field in nrel-pds-porotomo
alowney Dec 8, 2025
df75d9b
Update ManagedBy field in nrel-pds-sup3rcc.yaml
alowney Dec 8, 2025
1135313
Update ManagedBy field in windai.yaml
alowney Dec 8, 2025
5af50c2
Update ManagedBy field in nrel-pds-wtk.yaml
alowney Dec 8, 2025
9e7b92c
ok: Update nrel-pds-wtk.yaml
cstner Dec 9, 2025
cd861e7
ok: Merge pull request #2977 from alowney/patch-46
cstner Dec 9, 2025
a210c23
Merge branch 'main' into patch-45
cstner Dec 9, 2025
6718175
ok: Update nrel-pds-windai.yaml
cstner Dec 9, 2025
21c489e
ok: Merge pull request #2976 from alowney/patch-45
cstner Dec 9, 2025
1338ed4
Merge branch 'main' into patch-44
cstner Dec 9, 2025
8ef7962
ok: Update nrel-pds-sup3rcc.yaml
cstner Dec 9, 2025
2924fcd
ok: Merge pull request #2975 from alowney/patch-44
cstner Dec 9, 2025
e5c0400
Merge branch 'main' into patch-43
cstner Dec 9, 2025
b57d818
ok: Update nrel-pds-porotomo.yaml
cstner Dec 9, 2025
7562d68
ok: Merge pull request #2974 from alowney/patch-43
cstner Dec 9, 2025
49059f6
Merge branch 'main' into patch-42
cstner Dec 9, 2025
4780959
ok: Update nrel-pds-nsrdb.yaml
cstner Dec 9, 2025
c435679
Update kanagawa_pointcloud.yaml
kanagawa-pointcloud Dec 9, 2025
43cb1fd
ok: Merge pull request #2973 from alowney/patch-42
cstner Dec 9, 2025
35d988e
Merge branch 'main' into patch-41
cstner Dec 9, 2025
148faff
ok: Update nrel-pds-ncdb.yaml
cstner Dec 9, 2025
938081c
ok: Merge pull request #2972 from alowney/patch-41
cstner Dec 9, 2025
25d55b6
Merge branch 'main' into main
kanagawa-pointcloud Dec 9, 2025
e393580
Merge branch 'main' into patch-40
cstner Dec 9, 2025
ea5b9b3
ok: Update nrel-pds-dsgrid.yaml
cstner Dec 9, 2025
ba2615d
ok: Merge pull request #2971 from alowney/patch-40
cstner Dec 9, 2025
51a278e
Merge branch 'main' into patch-39
cstner Dec 9, 2025
fcb282a
ok: Update nrel-pds-building-stock.yaml
cstner Dec 9, 2025
b04d012
ok: Merge pull request #2970 from alowney/patch-39
cstner Dec 9, 2025
aea6b62
Merge branch 'main' into patch-38
cstner Dec 9, 2025
f70f194
ok: Update marine-energy-data.yaml
cstner Dec 9, 2025
4ec847e
ok: Merge pull request #2969 from alowney/patch-38
cstner Dec 9, 2025
c4b19ab
Merge branch 'main' into patch-37
cstner Dec 9, 2025
ccf9559
ok: Update oedi-data-lake.yaml
cstner Dec 9, 2025
5df6b14
ok: Merge pull request #2968 from alowney/patch-37
cstner Dec 9, 2025
a99af0a
Merge branch 'main' into patch-36
cstner Dec 9, 2025
b2cde85
ok: Update gdr-data-lake.yaml
cstner Dec 9, 2025
8855570
ok: Merge pull request #2967 from alowney/patch-36
cstner Dec 9, 2025
a50dd95
Update japan_pointcloud.yaml
japan-pointcloud Dec 9, 2025
ffb9385
Merge branch 'main' into main
japan-pointcloud Dec 9, 2025
2ea6285
updated SNS queue arn
mwielocha Dec 9, 2025
4d807ec
Merge branch 'nuview-state-opendata' of github.com:s22s/open-data-reg…
mwielocha Dec 9, 2025
437c43c
Merge branch 'main' into nuview-state-opendata
mwielocha Dec 9, 2025
9a5d5cb
Merge branch 'main' into add-alliance-genome-resources
berylrab Dec 9, 2025
627eb05
ok: Update alliance-genome-resources.yaml
berylrab Dec 9, 2025
0b1215a
ok: Update alliance-genome-resources.yaml
berylrab Dec 9, 2025
9b983e6
ok: Merge pull request #2910 from christabone/add-alliance-genome-res…
berylrab Dec 9, 2025
3ec34f0
Merge branch 'main' into main
cstner Dec 9, 2025
81e7c79
ok: Update japan_pointcloud.yaml
cstner Dec 9, 2025
90d0769
ok: Merge pull request #2948 from japan-pointcloud/main
cstner Dec 9, 2025
1a7a2eb
Merge branch 'main' into main
cstner Dec 9, 2025
6a2fecd
ok: Update kanagawa_pointcloud.yaml
cstner Dec 9, 2025
0ac2b27
ok: Merge pull request #2935 from kanagawa-pointcloud/main
cstner Dec 9, 2025
5719c34
Merge branch 'main' into nuview-state-opendata
cstner Dec 9, 2025
cd64989
ok: Update nuview-state.yaml
cstner Dec 9, 2025
f46b962
removed RequesterPays
mwielocha Dec 9, 2025
8855a6b
Merge branch 'nuview-state-opendata' of github.com:s22s/open-data-reg…
mwielocha Dec 9, 2025
82f240f
Update metadata
adamltyson Dec 10, 2025
e6b7638
ok: Update nuview-state.yaml
cstner Dec 11, 2025
9ef4fc6
ok: Update nuview-state.yaml
cstner Dec 11, 2025
5695404
ok: Merge pull request #2943 from s22s/nuview-state-opendata
cstner Dec 11, 2025
5694a30
Merge branch 'main' into add-brainglobe
berylrab Dec 11, 2025
9bf5762
ok: Update brainglobe.yaml
berylrab Dec 11, 2025
9574504
ok: Update brainglobe.yaml
berylrab Dec 11, 2025
a4541b2
Add files via upload
OpsCCRS Dec 12, 2025
812cfc0
Add Open Human Genome Library (OpenHGL)
lh3 Dec 14, 2025
e4557d8
Add SNS topic
adamltyson Dec 15, 2025
aa52d30
ok: Update brainglobe.yaml
berylrab Dec 15, 2025
5b43d51
ok: Update brainglobe.yaml
berylrab Dec 15, 2025
26329b8
ok: Merge pull request #2913 from brainglobe/add-brainglobe
berylrab Dec 15, 2025
3cfd830
ok: Update uniprot.yaml
berylrab Dec 15, 2025
60c3b68
ok: Merge pull request #2982 from awslabs/berylrab-patch-4
berylrab Dec 15, 2025
0b7130b
Merge branch 'main' into main
cstner Dec 15, 2025
2733cd2
ok: Update CCRSMODISAlbedo.yaml
cstner Dec 15, 2025
70f485f
ok: Update CCRSMODISAlbedo.yml
cstner Dec 15, 2025
d77e12f
ok: Update CCRSMODISAlbedo.yml
cstner Dec 15, 2025
e96b3fe
Delete CCRSMODISAlbedo.yml
OpsCCRS Dec 16, 2025
9b866ea
Add files via upload
OpsCCRS Dec 16, 2025
ff24287
ok: Update CCRSMODISAlbedo.yml
cstner Dec 16, 2025
1c0dc68
ok: Update and rename CCRSMODISAlbedo.yml to CCRSMODISAlbedo.yaml
cstner Dec 16, 2025
0a3154d
ok: Update and rename CCRSMODISAlbedo.yaml to ccrsmodisalbedo.yaml
cstner Dec 16, 2025
9da0319
ok: Update ccrsmodisalbedo.yaml
cstner Dec 16, 2025
55f1124
Add files via upload
OpsCCRS Dec 16, 2025
6844feb
Updating access URL
Arun-George-Zachariah Dec 16, 2025
25d05bc
Delete datasets/CCRSMODISAlbedo.yml
OpsCCRS Dec 17, 2025
939180e
Add files via upload
OpsCCRS Dec 17, 2025
49a4473
Delete datasets/ccrsmodisalbedo.yml
OpsCCRS Dec 17, 2025
f2e45d9
Update ccrsmodisalbedo.yaml
OpsCCRS Dec 17, 2025
0fdf639
ok: Update ccrsmodisalbedo.yaml
cstner Dec 18, 2025
bbe8d21
Merge branch 'main' into main
berylrab Dec 18, 2025
4a0089e
ok: Update ccrsmodisalbedo.yaml
cstner Dec 18, 2025
0600466
ok: Update asl_1000.yaml
berylrab Dec 18, 2025
77ec13d
ok: Merge pull request #2983 from Arun-George-Zachariah/main
berylrab Dec 18, 2025
35d708a
Update ccrsmodisalbedo.yaml
OpsCCRS Dec 18, 2025
16b97f3
ok: Update ccrsmodisalbedo.yaml
cstner Dec 18, 2025
3e5339a
Merge branch 'main' into main
cstner Dec 18, 2025
d55dea4
ok: Update ccrsmodisalbedo.yaml
cstner Dec 18, 2025
8f3e2e1
ok: Merge pull request #2940 from OpsCCRS/main
cstner Dec 18, 2025
74e704c
Add SNS topic resource
lh3 Dec 19, 2025
d62a4b4
ok: Update openhgl.yaml
berylrab Dec 22, 2025
97b1999
Merge branch 'main' into OpenHGL
berylrab Dec 22, 2025
7c3290b
ok: Update openhgl.yaml
berylrab Dec 22, 2025
ed1e4d2
ok: Merge pull request #2980 from lh3/OpenHGL
berylrab Dec 22, 2025
48b2660
Add S3 bucket and SNS topic for salk-aging-mouse-brain-epigeneti
bryanatsalk Dec 22, 2025
e2496cc
ok: Update salk-aging-mouse-brain-epigeneti.yaml
berylrab Dec 23, 2025
4971ebd
Merge branch 'main' into main
berylrab Dec 23, 2025
b88a1ca
ok: Update salk-aging-mouse-brain-epigeneti.yaml
berylrab Dec 23, 2025
04fbc29
ok: Update salk-aging-mouse-brain-epigeneti.yaml
berylrab Dec 23, 2025
80628b3
ok: Merge pull request #2949 from bjnielsen/main
berylrab Dec 23, 2025
de00578
move frag-struc.yaml to the datasets directory
yuukiiwa Dec 30, 2025
d7be310
Merge branch 'main' into add_dataset
berylrab Jan 2, 2026
b43d3c1
Update frag-struc.yaml
berylrab Jan 2, 2026
116ff88
ok: Update frag-struc.yaml
berylrab Jan 2, 2026
9ed2ba2
ok: Update frag-struc.yaml
berylrab Jan 2, 2026
17dc647
ok: Update frag-struc.yaml
berylrab Jan 2, 2026
aab3288
ok: Merge pull request #2988 from yuukiiwa/add_dataset
berylrab Jan 2, 2026
7046b8a
add notebook and update arn
remo87 Jan 7, 2026
b24e357
Add crescent_dunes dataset and update links
alowney Jan 7, 2026
0965357
Merge branch 'main' into main
berylrab Jan 7, 2026
e02fb0a
ok: Update tags.yaml
berylrab Jan 7, 2026
71e1328
ok: Merge pull request #2990 from awslabs/berylrab-patch-5
berylrab Jan 7, 2026
9640d6a
Add NOAA S-104 Water Level Data specification
kszura Jan 7, 2026
9eb11d6
ok: Update noaa-s104.yaml
cstner Jan 7, 2026
5aa23a7
ok: Update noaa-s104.yaml
cstner Jan 7, 2026
9f3371e
ok: Update noaa-s104.yaml
cstner Jan 7, 2026
e363204
ok: Update noaa-s104.yaml
cstner Jan 7, 2026
d04a611
ok: Merge pull request #2992 from kszura/patch-13
cstner Jan 7, 2026
fd43f07
delete new ot file found duplicated file
remo87 Jan 8, 2026
cf61f57
Merge branch 'main' of https://github.com/remo87/open-data-registry
remo87 Jan 8, 2026
964ba05
add aditional information
remo87 Jan 8, 2026
6fc4377
Add updated Alaska and West Coast datasets
alowney Jan 8, 2026
a55fe59
ok: Create anvilproject.yaml
berylrab Jan 8, 2026
1b82996
ok: Update anvilproject.yaml
berylrab Jan 8, 2026
926fa71
ok: Merge pull request #2997 from awslabs/berylrab-patch-5
berylrab Jan 8, 2026
ff59d83
Merge branch 'main' into patch-48
cstner Jan 8, 2026
b49e5ed
ok: Update wpto-pds-us-wave.yaml
cstner Jan 8, 2026
251ca53
ok: Merge pull request #2996 from alowney/patch-48
cstner Jan 8, 2026
a500e98
update tags
remo87 Jan 9, 2026
4510e29
ok: Update opentargets.yaml
berylrab Jan 9, 2026
d7c2685
Merge branch 'main' into main
berylrab Jan 9, 2026
2232002
ok: Update opentargets.yaml
berylrab Jan 9, 2026
e6c14c0
ok: Update opentargets.yaml
berylrab Jan 9, 2026
f981016
ok: Update opentargets.yaml
berylrab Jan 9, 2026
e390cf7
ok: Update opentargets.yaml
berylrab Jan 9, 2026
f328b51
ok: Merge pull request #2961 from remo87/main
berylrab Jan 9, 2026
af5d5bb
Create smaht.yaml
berylrab Jan 9, 2026
3670852
Pointing at some new services
willmacs Jan 9, 2026
53b2ee9
Merge branch 'awslabs:main' into main
willmacs Jan 9, 2026
956a392
ok: Update smaht.yaml
berylrab Jan 9, 2026
5dcb0d5
ok: Update smaht.yaml
berylrab Jan 9, 2026
03ea9f5
ok: Merge pull request #2998 from awslabs/berylrab-patch-5
berylrab Jan 9, 2026
b0a28e7
ok: Update smaht.yaml
berylrab Jan 9, 2026
043b4c3
ok: Merge pull request #3000 from awslabs/berylrab-patch-6
berylrab Jan 9, 2026
f2f5148
Update noaa-s104.yaml
kszura Jan 12, 2026
5ccb0de
ok: Update noaa-s104.yaml
cstner Jan 12, 2026
0dde2dd
ok: Merge pull request #3002 from kszura/patch-14
cstner Jan 12, 2026
9d9428e
Merge branch 'main' into main
cstner Jan 12, 2026
843331b
ok: Update rcm-ceos-ard.yaml
cstner Jan 12, 2026
e1b1381
ok: Merge pull request #2999 from willmacs/main
cstner Jan 12, 2026
f78c60e
Merge branch 'main' into patch-47
cstner Jan 12, 2026
84e2015
ok: Update oedi-data-lake.yaml
cstner Jan 12, 2026
6d7945f
ok: Merge pull request #2989 from alowney/patch-47
cstner Jan 12, 2026
b7ede6e
adding users and editing data entry
msiron-entalpic Jan 13, 2026
8fb9fec
Merge branch 'main' into lemat-rho-yaml
msiron-entalpic Jan 13, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
4 changes: 4 additions & 0 deletions datasets/3kricegenome.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,10 @@ Documentation: https://github.com/awslabs/open-data-docs/tree/main/docs/3kricege
Contact: http://iric.irri.org/contact-us
ManagedBy: '[International Rice Research Institute](https://www.irri.org/)'
UpdateFrequency: Not updated
Collabs:
ASDI:
Tags:
- agriculture
Tags:
- agriculture
- food security
Expand Down
24 changes: 24 additions & 0 deletions datasets/aef-source.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
Name: Google Satellite Embedding V1
Description: COG (Cloud-Optimized GeoTIFF) files that together contain the AlphaEarth Foundations annual Satellite Embedding dataset. It contains the annual embeddings for the years from 2018 to 2024, inclusive.
Documentation: https://source.coop/tge-labs/aef
Contact: https://cloudnativegeo.org/join
ManagedBy: "[Source Cooperative](https://source.coop/)"
UpdateFrequency: As new data versions become available
Tags:
- aws-pds
- machine learning
- satellite imagery
- aerial imagery
- earth observation
- imaging
License: CC-BY 4.0
Citation: "The AlphaEarth Foundations Satellite Embedding dataset is produced by Google and Google DeepMind."
Resources:
- Description: Google Satellite Embedding V1
ARN: arn:aws:s3:::us-west-2.opendata.source.coop/tge-labs/aef
Region: us-west-2
Type: S3 Bucket
Explore:
- '[Browse Dataset](https://source.coop/tge-labs/aef/)'
ADXCategories:
- Environmental Data
4 changes: 4 additions & 0 deletions datasets/africa-field-boundary-labels.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,10 @@ Documentation: Information on the primary dataset can be found [here](https://gi
Contact: airg@clarku.edu
ManagedBy: "[The Agricultural Impacts Research Group](https://agroimpacts.info/)"
UpdateFrequency: "Updated versions of the dataset are added as they are developed"
Collabs:
ASDI:
Tags:
- agriculture
Tags:
- agriculture
- machine learning
Expand Down
4 changes: 4 additions & 0 deletions datasets/ag-loam.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,10 @@ Documentation: https://github.com/UCR-Robotics/AG-LOAM
Contact: Hanzhe Teng (hteng007@ucr.edu), Konstantinos Karydis (kkarydis@ece.ucr.edu)
ManagedBy: "[Autonomous Robots and Control Systems Lab](https://sites.google.com/view/arcs-lab)"
UpdateFrequency: NA
Collabs:
ASDI:
Tags:
- agriculture
Tags:
- aws-pds
- robotics
Expand Down
37 changes: 37 additions & 0 deletions datasets/ai3.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
Name: AI3 Protein-Ligand Binding Affinity Dataset
Description: >
The rapid advancement of computing technologies, particularly artificial intelligence (AI), has revolutionized various domains, including drug discovery. Curated datasets are crucial for developing reliable, generalizable, and accurate models for practical applications. Generating experimental data on a large scale is an expensive and arduous process. In domains such as medical diagnostics where real-life data is hard to obtain, synthetic data has been shown to be extremely valuable. We, teams from IIIT Hyderabad, Intel, AWS, and Insilico Medicine, have performed physics-based calculations (molecular dynamics simulations) on about 20,000 protein-ligand complexes. The dataset comprises molecular dynamics snapshots, binding affinities calculated using the MM-PBSA method, and individual energy components, including electrostatic and van der Waals interactions. DatasetFileFormats essentially incorporate i. 3D coordinates of the protein-ligand complexes (pdb) in tar.gz files, and ii. CSV files containing the energy data. DatasetUsages are on i. ML scoring function for predicting binding affinities of given protein-ligand complexes, ii. Classification models for predicting correct binding poses of ligands, iii. identification of cryptic binding pockets, and iv. optimization of binding features by exploiting the individual components of the energy (experimental data has only the total binding affinity). Further, the novelty of the dataset highlights the fact that existing AI/ML training datasets lack dynamic data and are inherently biased. Further, binding affinity data existing in the literature are obtained from different experimental protocols. Therefore, this dataset has been uniquely created (from the same computational protocols) followed by free energy calculations with molecular dynamics (MD) simulations. The dynamic data-enriched protein-ligand coordinates can be used to effectively train convolutional neural network-based regression models for more accurate binding affinity prediction.
Documentation: https://github.com/devalab/AI3
Contact: devalab@iiit.ac.in
ManagedBy: International Institute of Information Technology Hyderabad
UpdateFrequency: Not updated
Tags:
- pharmaceutical
- simulations
- health
- life sciences
- machine learning
- protein
- molecular dynamics
- aws-pds
License: https://devalab.in/AI3.html
Resources:
- Description: ai3data bucket includes coordinates and the energetics of ~20,000 protein-ligand binding affinity datasets. The subfolders of ai3data bucket consist of Version 1, Version2 and Version 3. Version1 contains the total Size of 10.4 GiB (Initial structure of the protein-ligand complex and the average binding affinities along with average energy components). Version2 contains the total Size of 1.2 TiB (Five trajectories of protein-ligand complex (200 snapshots in all) and the closest two water molecules for each of the protein-ligand complex, and the time series of the binding affinities along with average energy components). Version3 contains the total Size of 10.7 TiB (Five trajectories of completely solvated protein-ligand complex (200 snapshots in all), and the time series of binding affinities along with average energy components).
ARN: arn:aws:s3:::ai3data
Region: us-east-1
Type: S3 Bucket
DataAtWork:
Tutorials:
- Title: "AI3: Protein-Ligand Binding Affinity Dataset"
URL: https://github.com/devalab/AI3
AuthorName: Deva Priyakumar Lab
AuthorURL: https://github.com/devalab
Publications:
- Title: "PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications"
URL: https://www.nature.com/articles/s41597-022-01631-9
AuthorName: U. Deva Priyakumar
AuthorURL: https://devalab.in/
- Title: "PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications"
URL: https://www.nature.com/articles/s41597-023-02872-y
AuthorName: U. Deva Priyakumar
AuthorURL: https://devalab.in
46 changes: 46 additions & 0 deletions datasets/allen-hmba-releases.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
Name: Human and Mammalian Brain Atlas
Description:
Human and Mammalian Brain Atlas (HMBA) is a major atlas of the BRAIN Initiative Cell Atlas Network (BICAN) that proposes to establish a comprehensive,
highly granular cell atlas in complete adult human, macaque, and marmoset brains that links brain structure, function and cellular architecture.
Release artifacts have been made available in this OpenData bucket to enable utilization along with their paper publications by the neuroscience community.
Documentation: https://portal.brain-map.org/explore/hmba
Contact: awspds@alleninstitute.org
ManagedBy: "[Allen Institute](http://www.alleninstitute.org/)"
UpdateFrequency: Never
Tags:
- aws-pds
- biology
- gene expression
- neurobiology
- life sciences
- single-cell transcriptomics
- Mus musculus
- Homo sapiens
- non-human primate
License: http://www.alleninstitute.org/legal/terms-use/
Citation:
Resources:
- Description: Project data files in a public bucket
ARN: arn:aws:s3:::allen-hmba-releases
Region: us-west-2
Type: S3 Bucket
DataAtWork:
Tutorials:
- Title: Human-Mammalian Brain - Basal Ganglia - Data
URL: https://alleninstitute.github.io/abc_atlas_access/descriptions/HMBA-BG_dataset.html
AuthorName: Allen Institute for Brain Science
AuthorURL: www.alleninstitute.org
- Title: Human-Mammalian Brain - CCF Book
URL: https://alleninstitute.github.io/CCF-MAP/
AuthorName: Allen Institute for Brain Science
AuthorURL: www.alleninstitute.org
Tools & Applications:
- Title: HMBA Basal Ganglia resources in Brain Knowledge Platform's Data Catalog
URL: https://knowledge.brain-map.org/data/POZ2HCPBT60DSDJ8UA7
AuthorName: Allen Institute for Brain Science
AuthorURL: www.alleninstitute.org





85 changes: 85 additions & 0 deletions datasets/alliance-genome-resources.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
Name: Alliance of Genome Resources
Description: The Alliance of Genome Resources is a consortium that integrates genomic, genetic, and molecular data from leading model organism databases including Drosophila melanogaster, Caenorhabditis elegans, Danio rerio (zebrafish), Mus musculus (mouse), Rattus norvegicus (rat), Saccharomyces cerevisiae (yeast), Xenopus laevis and Xenopus tropicalis (frogs), and human reference data. The Alliance provides comprehensive datasets including gene annotations, disease associations, expression data (bulk and single-cell RNA-Seq), protein and genetic interactions, orthology relationships, variants and alleles, and complete genome sequences with annotations. Data is organized into Alliance-wide integrated datasets and organism-specific collections, supporting comparative genomics, disease modeling, and functional genomics research.
Documentation: https://github.com/alliance-genome/agr_open_data
Contact: help@alliancegenome.org
ManagedBy: Alliance of Genome Resources Consortium
UpdateFrequency: Quarterly releases (every ~3 months)
Tags:
- aws-pds
- genomic
- bioinformatics
- biology
- gene expression
- life sciences
- genetic
- genome
- Drosophila melanogaster
- Caenorhabditis elegans
- Danio rerio
- Mus musculus
- Rattus norvegicus
- Homo sapiens
- transcriptomics
- protein
- vcf
- fasta
License: Most Alliance data is available under CC0 1.0 Universal (Public Domain Dedication). Some datasets may use CC-BY 4.0 (attribution required). Full details at https://www.alliancegenome.org/terms-of-use
Citation: Alliance of Genome Resources Consortium. Alliance of Genome Resources Portal - unified model organism research platform. Nucleic Acids Research (2023). https://doi.org/10.1093/nar/gkac1003
Resources:
- Description: Alliance-wide integrated datasets including disease associations, gene expression, molecular and genetic interactions, orthology relationships, gene descriptions, and variants across all Alliance organisms. Data is organized by release version (8.3.0/, 8.2.0/, etc.), then by data type, with organism-specific collections for FB (FlyBase/Drosophila), MGI (Mouse), RGD (Rat), SGD (Yeast), WB (Worm), XBXL/XBXT (Xenopus), ZFIN (Zebrafish), and HUMAN reference data. Available in TSV, JSON, and VCF formats.
ARN: arn:aws:s3:::alliance-genome-downloads
Region: us-east-1
Type: S3 Bucket
Explore:
- '[Browse Bucket](https://alliance-genome-downloads.s3.amazonaws.com/)'
- Description: FlyBase-specific data for Drosophila melanogaster and related species, including gene annotations, GO annotations, expression data (bulk RNA-Seq, single-cell RNA-Seq), disease associations, phenotypes, interactions, orthologs, genome sequences (FASTA), and genome annotations (GFF3/GTF). Data organized by release (current/, FB2025_04/, etc.) with precomputed analysis files and complete Chado XML database dumps. Publicly accessible via HTTPS for direct download without AWS credentials.
ARN: arn:aws:s3:::s3ftp.flybase.org
Region: us-east-1
Type: S3 Bucket
Explore:
- '[Browse via HTTPS](https://s3ftp.flybase.org/releases/current/)'
DataAtWork:
Tutorials:
- Title: Alliance of Genome Resources AWS Data Access Tutorials
URL: https://github.com/alliance-genome/agr_open_data/blob/main/TUTORIAL.md
AuthorName: Alliance of Genome Resources Consortium
AuthorURL: https://www.alliancegenome.org
Tools & Applications:
- Title: Alliance of Genome Resources Portal
URL: https://www.alliancegenome.org
AuthorName: Alliance of Genome Resources Consortium
AuthorURL: https://www.alliancegenome.org
- Title: FlyBase - Drosophila Database
URL: https://flybase.org
AuthorName: FlyBase Consortium
AuthorURL: https://flybase.org
- Title: WormBase - C. elegans Database
URL: https://www.wormbase.org
AuthorName: WormBase Consortium
AuthorURL: https://www.wormbase.org
- Title: ZFIN - Zebrafish Database
URL: https://zfin.org
AuthorName: ZFIN
AuthorURL: https://zfin.org
- Title: MGI - Mouse Genome Database
URL: http://www.informatics.jax.org
AuthorName: MGI
AuthorURL: http://www.informatics.jax.org
- Title: RGD - Rat Genome Database
URL: https://rgd.mcw.edu
AuthorName: RGD
AuthorURL: https://rgd.mcw.edu
- Title: SGD - Saccharomyces Genome Database
URL: https://www.yeastgenome.org
AuthorName: SGD
AuthorURL: https://www.yeastgenome.org
- Title: Xenbase - Xenopus Database
URL: http://www.xenbase.org
AuthorName: Xenbase
AuthorURL: http://www.xenbase.org
Publications:
- Title: Alliance of Genome Resources Portal - unified model organism research platform
URL: https://doi.org/10.1093/nar/gkac1003
AuthorName: Alliance of Genome Resources Consortium
ADXCategories:
- Healthcare & Life Sciences Data
4 changes: 4 additions & 0 deletions datasets/amazon-last-mile-challenges.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,10 @@ Contact: lastmile-research-challenge@amazon.com
ManagedBy: "[Amazon](https://www.amazon.com/)"
UpdateFrequency: None

Collabs:
ASDI:
Tags:
- infrastructure
Tags:
- transportation
- machine learning
Expand Down
Loading