Pasky’s Log

YodaQAâ€™s abilities are enlarged by traffic domain

May 23rd, 2016 1 comment

Guest post by Petr Marek (source)

Everybody driving a car needs the navigation to get to the destination fast and avoid traffic jam. One of the biggest problems is how to enter fast the destination and how to find where are the congestions, what is the traffic situation. YodaQA Traffic is a project attempting to answer the traffic related questions quickly and efficiently. Drivers may ask questions in natural language like: â€œWhat is the traffic situation in the EvropskÃ¡ street?â€ or â€œWhat is the fastest route from Opletalova street to Kafkova street?â€ You can try out the prototype (demo available only for limited time) – try to ask for example â€œtraffic situation in the Wilsonova streetâ€ .

YodaQA Traffic still has some limitations. Currently we only have a browser version not suitable for smart phones. It is answering traffic questions for Pragueâ€™s streets only.

But as usual, this whole technology demo is open source – you can find it in the branch f/traffic-flow of our Hub project.

How does it work and where we get the data from?

All YodaQA are first analyzed to recognize and select traffic questions. We do it in two steps. The first step is to recognize the question topic. We use six topics like traffic situation, traffic incident or fastest route. The topic is determined by comparing semantic similarity of the userâ€™s question with a set of reference questions. We estimate the similarity with our Dataset-STS Scoring API. Each reference question is labeled by a â€œtopicâ€. The Sentence Pair Similarity algorithm selects the reference question â€œtopicâ€ with the highest similarity to the question.

Next we need to recognize the location, i.e. to recognize the street name. This is handled by another tool called the Label-lookup which we normally use for entity linking in YodaQA. It compares questions words with a list of all street names in the Prague. We exported the list of streets names in Prague from OpenStreetMap. We do not do exact match, we try to select the closest street name from the list.

The last step is to decide whether the question is really the traffic question, because the Dataset-STS API and Label-lookup can find topic and street name even in a pure movie question like â€œWhen was the Nightmare on Elm Street released?â€. The Dataset-STS and Label-lookup return not only topic or street name but also the score, fortunately. We created dataset of over 70 traffic questions and over 300 movies questions and founded the minimal score thresholds, with which the recognition makes the lowest classification error on this dataset.

Once we know the type of question and the location we start a small script accessing the traffic situation data from HERE Maps. The only complication is that the the API doesnâ€™t return traffic situation for particular street, but bounding box only. To overcome this problem we have to find a bounding box for a desired location, using an algorithm we developed for this purpose. Then we call the traffic flow API to acquire the information for all streets in the bounding box. Finally, we filter out the traffic situation for the desired street.

It was great fun to work on this application, it is not perfect but it shows how to create intelligent assistants helping people solving various everyday situations. We are also excited to see, how the users will use the new functionality of YodaQA and how it will help them.

Categories: ailao, software Tags: 3c, dataset-sts, geo, guest, maps, nlp, sps, yodaqa

GPS souÅ™adnice ÄeskÃ½ch mÄ›st a obcÃ

February 1st, 2014 14 comments

Pro zobrazovÃ¡nÃ poloh dopadÅ¯ meteosond na IRC jsem potÅ™eboval v jednoduchÃ©m CSV formÃ¡tu seznam souÅ™adnic ÄeskÃ½ch mÄ›st, ale ukÃ¡zalo se, Å¾e je pÅ™ekvapivÄ› obtÃÅ¾nÃ© nÄ›co takovÃ©ho zÃskat. Sice existuje tabulka na jednom astronomickÃ©m webu, vÃ½bÄ›r tam zahrnutÃ½ch obcÃ je ale docela divnÃ½, nÄ›kde je mÃsto obce jen jejÃ ÄÃ¡st, atd.

Nakonec jsem zvolil postup “udÄ›lej si sÃ¡m”, a to kombinacÃ seznamu na Wikipedii, Google Geocoding API a trochy XPath.

Seznam rozumnÃ© podmnoÅ¾iny mÄ›st mohu zÃskat tÅ™eba pomocÃ:

curl 'http://cs.wikipedia.org/w/index.php?title=Seznam_obc%C3%AD_s_roz%C5%A1%C3%AD%C5%99enou_p%C5%AFsobnost%C3%AD&action=edit' |
  sed -ne 's/^# \[\[\([^]|]*|\)*\([^]]*\)\]\].*/\2/p' | sort

MÃ¡m-li zase jmÃ©no obce, jejÃ souÅ™adnice mohu zÃskat tÃmto zaklÃnadlem:

m=AÅ¡; curl -s 'http://maps.googleapis.com/maps/api/geocode/xml?address='"${m// /+},+CZ"'&sensor=false' |
  xmllint --xpath '//location[lat or lng]//text()' -

(DÅ¯leÅ¾itÃ½ trik je to ,CZ, jinak bude Google znÃ¡t spoustu KolÃnÅ¯ a AÅ¡ bude znamenat AmerickÃ¡ Samoa. AlternativnÄ› si mÅ¯Å¾ete z vÃ½sledkÅ¯ vyfiltrovat ty ÄeskÃ© pomocÃ XPath //result[address_component/short_name/text()="CZ"]/geometry/location[lat or lng]//text().)

curl 'http://cs.wikipedia.org/w/index.php?title=Seznam_obc%C3%AD_s_roz%C5%A1%C3%AD%C5%99enou_p%C5%AFsobnost%C3%AD&action=edit' |
  sed -ne 's/^# \[\[\([^]|]*|\)*\([^]]*\)\]\].*/\2/p' | sort |
  while read m; do
    echo -n $m
    curl -s 'http://maps.googleapis.com/maps/api/geocode/xml?address='"${m// /+},+CZ"'&sensor=false' |
      xmllint --xpath '//location[lat or lng]//text()' - |
      tr -s '\n' ' ' | tr ' ' ','
    echo
    sleep 0.1
  done | sed 's/,$//'

Categories: linux Tags: bash, curl, geo, google, sonde, wiki

Weathersonde – Nearby Landing Notification

January 26th, 2014 No comments

At our hackerspace brmlab, one of the things we do is picking up landed weather sondes. In short, fun hardware literally falling off the sky, several times a day, every day. These are stratospheric balloons used for weather data prediction, launched from various sites, that reach the 35km altitude, then the balloon bursts and it lands back on the ground at a random location. At the whole time, it transmits its current GPS coordinates via radio, making this a rather exciting sub-class of geocaching.

As a simple hack today (idea by chido), I created a simple script sonde.sh that is designed to be run three times a day, runs sonde trajectory prediction (a predict.habhub.org service – example) and if the sonde is predicted to land in a certain radius, reports that with a link to the prediction. By default, it is connected to jendabot, one of our brmlab IRC robotic minions, written in an appealingly crazy way as a collection of bash scripts.

Categories: life, software Tags: bash, curl, geo, geocaching, irc, sonde, wget

Pasky’s Log

Archive

YodaQAâ€™s abilities are enlarged by traffic domain

How does it work and where we get the data from?

GPS souÅ™adnice ÄeskÃ½ch mÄ›st a obcÃ

Weathersonde – Nearby Landing Notification

Recent Comments

Categories

Blogroll

Licence

Pasky’s Log

Archive

YodaQAâ€™s abilities are enlarged by traffic domain

How does it work and where we get the data from?

GPS souÅ™adnice ÄeskÃ½ch mÄ›st a obcÃ­

Weathersonde – Nearby Landing Notification

Recent Comments

Tags

Categories

Blogroll

Licence

GPS souÅ™adnice ÄeskÃ½ch mÄ›st a obcÃ