Disease Monitor

Each infectious disease has a spreading pattern. It is spread under certain conditions, using certain environmental factors and sometimes it infects only specific hosts (animals, humans). Finding out such a pattern can sometimes prove difficult, moreover following the distribution is challenging. A tool must be develop in order to facilitate scientists and medical staff to follow how a certain disease has spread and how it relates to other conditions and environmental factors (weather, location etc.).

  • Requirements: data
  • Programming Skill Level: Intermediate to Advanced;
  • Designer Skill Level: Intermediate to Advanced;
  • Other Skills: ;
  • Similar projects:

Color Perception and Data Vizualization

Develop an application which will be able to generate Ishihara Tests based on data from a database. The data will be loaded in the application which will determine if such a visualization is possible from the current data set. If not the data will be correlated/linked with other data sets in order to generate the visualization.

  • Requirements: visualization library
  • Programming Skill Level: Intermediate to Advanced;
  • Designer Skill Level: Intermediate to Advanced;
  • Other Skills: ;
  • Similar projects:

Info graphics HTML5 Editor

So far many organization had published (and still do) a lot of (open) data. While this is a good practice, it is still hard to make sense of all this data. A quick easy method to make the data available to the public in a easy manner is to present it in an inforgraphic. Therefore we require a web application that takes advantage of predefined widgets with which a user can design his own infographic. The widgets will be dynamic and can incorporate data from a json file.

This idea is similar to one I proposed a while back Visualize your data .

Tag Cloud & Text Classification

Considering we have so many texts, books, news, papers & scientific articles in many magazines and articles across the internet it is hard to classify them based on the content we have in them. Thus we require an application which, based on a set of given articles is able to display a tag cloud of the most popular words (consider multiple forms of the words used e.g. “lexemes” ; also consider setting a minimum number of words) from the specified set. Consider different visualization methods of the cloud, sphere shaped, ordered by most popular, different orientations of the words. Also display relationships between words that appear in the same sentence, the same paragraph, or appear near one another multiple times. It will also be possible to view the sentence that word initially belonged to.

Bonus: a side by side generation/comparison of tag clouds/texts.

  • Requirements: ;
  • Programming Skill Level: Beginner to Intermediate ;
  • Designer Skill Level: Intermediate to Advanced;
  • Other Skills: ;
  • Similar projects:

Semantic Organic Medicine

An organic drug store needs a way to organize its medicinal plant collection and sort it depending on the disease/medical condition it could be recommended for. A disease/medical condition has certain symptoms (some diseases/medical conditions have similar symptoms) and medicinal plants treat certain diseases/medical conditions (sometimes the medicinal plants only treat a limited number of symptoms). Thus the drug store needs a quick way of searching through its database and finding the best organic plant for a certain disease/medical condition.

Notice that certain symptoms might be more important than others depending on how the patient describes them.

Bonus: use semantic web technologies.

  • Requirements: ;
  • Programming Skill Level: Intermediate to Advanced;
  • Designer Skill Level: Beginner to Intermediate;
  • Other Skills: ;
  • Similar projects:

Digital Legacy Box

Our grandparents and parents stored all their pictures and memories in boxes (or house attics), with the intent of leaving a reminder for their children or grandchildren. But as the new generation comes most of our pictures and memories aren’t stored in physical boxes like the ones our grandparents gave us, but on computers (or on the “cloud”). Also considering worldwide the number of digital pictures increases annually (further reading ), what are we going to leave our children/relatives? (lots and lots of GB for them to make sense of)

The idea is to design and Develop a Digital Legacy Box (DLB) application where people can leave reminders for their relatives after they pass away. The application will encompass travels, important life changes etc. by linking different accounts of a certain person (facebook, twitter, flickr, linkedin, travel apps, etc.) and display them to the relatives after that person passed away.

Bonus: How do we distinguish between a person passing away and one who stopped using social networks ?

Cafeteria Management

Considering the large number of students/workers and meals a cafeteria has to handle, they are looking to improve the way they handle their clients, payments and meals.

They require an application which could help them with:

  • getting the number of students/workers that had lunch in a certain day;
  • finding out what was the most popular dish (main course and/or desert) that day;
  • automatize payments using NFC (Near Field Communication) and/or e-Payment (Prepayd card, e-money etc.);
  • displaying the menu online for the current month (RSS feed/or website);
  • receive requests regarding certain dishes;
  • personalize dishes for lactose intolerant, lacto vegetarian, vegans, meat lovers, fish lovers etc.;
  • optimize dishes to be affordable and healthy;
  • personal nutritionist regarding what a student/worker ate that day/week/month (available only for those that pay with NFC or e-Payment ).


  • Requirements: ;
  • Programming Skill Level: Intermediate to Advanced;
  • Designer Skill Level: Beginner to Intermediate;
  • Other Skills: Food enthusiast :) ;
  • Similar projects:

Semantic web resources – part 2

As a follow up to the previous Semantic web resources – part 1 post, this post is focused on listing some of the existing Ontologies and SPARQL Endpoints

(sort of a) Beginner’s Guide:


Examples of SPARQL Endopoints:


Tools & Other Resources:



Open data Sources

*This post is constantly updating so check back at any time. Last update: 05.November.2013*

Open Data Handbook – http://opendatahandbook.org/

School of Data Handbook – http://schoolofdata.org/handbook/

Linked Data Patterns Book – http://patterns.dataincubator.org/book/

What is Open Data and how we might define it, you can find out more at Open Definition and Guide to Open Data Licensing made by Open Knowledge Foundation.

A great way to search for Open Data sources is: http://datacatalogs.org/dataset (doesn’t have all the sources, some of the ones I mentioned below are missing).

http://openrefine.org/ – tool for working with messy data, cleaning it up, transforming it from one format into another

Another way to search for open public data is http://www.google.com/publicdata/directory .

A great open source data portal software is: http://ckan.aorg/ . (used in many open data portals)

An interesting website (partly blog) about data and data visualization – http://www.datapointed.net/

Open data policy and Definitions:


Various Types of Open Data:


Ocean Open Data:


Search Engines/Crawl Open Data:


Maps Open Data:


Weather Maps Open Data:


Library Open Data:


Newspapers Open Data:


Music/Movie Industry Open Data:


University Open Data:


Sensor Open Data:


Health Open Data Sources:


Financial Open Data Sources:


Government Open Data Sources:

Semantic web resources – part 1

Here are a few resources where you can find more about the semantic web. I tried to divide them according to certain categories. Also feel free to add your own in the comments.

This is just part 1 – I will follow up with additional resources such as books, articles and other related resources in future posts.

Semantic web resources part 2

(sort of a) Beginner’s Guide:


W3C Specifications (you might want to check out):




Microdata & RDFa:




(useful) Libraries: