Directory Structure #2

emilbayes · 2015-09-25T11:56:09Z

Let's discuss how we should organise the repo now that we've decided that using the wiki feature wasn't ideal.

I imagine a folder structure along these lines?

.
├── LICENSE.md
├── README.md
├── government
│   ├── README.md
│   ├── bbr.md
│   ├── dawa.md
│   ├── dst.md
│   ├── ft.md
│   ├── kortforsyningen.md
│   ├── miljoeportal.md
│   ├── uddannelsesstatistik.md
│   ├── uni-c.md
│   └── virk.md
├── museums-and-archives
│   ├── README.md
│   ├── kb.md
│   ├── natmus.md
│   ├── politietsregisterblade.md
│   ├── ppt-museum.md
│   └── smk.md
└── private-companies
    ├── README.md
    ├── dr.md
    └── postdanmark.md

However there are some unsolved problems with this structure. Should each dataset have its own file, or should they be grouped by organisation? I've based file names on FQDN, but then organisations like AU which have many diverse datasets might become hard to get an overview of. Or maybe we should try and group by categories like Infrastructure, History, Environment, Media etc. ?

The text was updated successfully, but these errors were encountered:

AndreasMadsen · 2015-09-25T16:27:58Z

Should each dataset have its own file, or should they be grouped by organisation

I think it would be best to add a file for each resource in general. But there are nice cases like dawa.aws.dk where a single file for dawa is enough. In that case it shouldn't get its own directory.

I've based file names on FQDN

I like that. Maybe we should add .dk but it might be redundant in this case. But I definitely think dawa should be dawa.aws.md.

but then organisations like AU which have many diverse datasets might become hard to get an overview of.

I don't think they have that many datasets them self. Its is more a documentation on what are the social issues and what organizations provides datasets that can describe them. So lets revisit that if it becomes an issue.

Or maybe we should try and group by categories like Infrastructure, History, Environment, Media etc. ?

If the purpose is to have an internal documentation of the raw APIs and then later provide module/API on top of that, then it makes sense to do the (organization-type, organization, resource).

If the purpose is to have an overview of other dataminers, then the data types makes more sense. However doing this means we can't organize the files using FQDN.

I prefer the organization-type, as it makes it easier to edit the file (involves a finding a file I already know about). Browsing the files using data-types can then be done by creating an external index.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Directory Structure #2

Directory Structure #2

emilbayes commented Sep 25, 2015

AndreasMadsen commented Sep 25, 2015

Directory Structure #2

Directory Structure #2

Comments

emilbayes commented Sep 25, 2015

AndreasMadsen commented Sep 25, 2015