p/data-world
The social network for data people
Brandon Gadoci
data.world β€” The social network for data people
Featured
56
β€’
Replies
Mike Coutermarsh
Hey @contextjunkie! I'm not familiar with SPARQL - How's it different from sql? Do people need to know it to use dataworld? Also - saw dwsql, which syntax already looks very familiar/identical to sql. What are the differences? What's each used for?
Shad Reynolds
@mscccc @contextjunkie Hi Mike, Where SQL is the language of choice for tabular and relational data, SPARQL is more well suited to pattern matching across linked data (RDF, Semantic Web, etc). The languages look somewhat similar, but serve distinct purposes. We believe that linked data is an important part of the future of open data. We've put together an awesome SPARQL tutorial for those who want to learn more: https://docs.data.world/tutorial... We created dwSQL to make the powerful querying and joining capabilities of linked data accessible to anyone who knows SQL. Our implementation is quite full featured. We support the vast majority of SELECT style queries: including joins, aggregation, sorting, limits, etc. You can learn more here: https://docs.data.world/tutorial...
Andrew Ward
Awesome product, this is like a github for data! Curious how you've thought about the opportunity to accelerate learning for the next generation of people just starting to get into the data world.
Carolee Mitchell
@andrewaward This question couldn't be more astute. The key to our ongoing success will heavily rely the next generation of data people. To that end, we are active in forming relationships at universities via speaking engagements, hosted class projects, and capstone sponsorships. Github provided a great platform for coders to create a portfolio of their work; we are that portfolio and repository for data folks. This is especially important as people start their careers in data. We are also building partnerships with organizations like Data Society and Coding4TX, which focus on an even younger, emerging data people crowd.
Kevando
Very sharp looking website and video! Have you guys heard of Datazar?
Joe Boutros
@kevando_ thanks for the kind words! There are a lot of other great platforms approaching these types of 'first mile' data problems, Datazar among them. What became super clear to us last year when we started data.world is that the time is right for data science to undergo the same transformation effect that open source software did with the rise of github.
Brandon Gadoci
@kevando_ Thanks for the kind words on our design! We've spent a lot of time thinking about it both inside and outside the app.
Dan Driscoll
Great team, great product, could be a game-changer for how disparate but complementary data sets are discovered and integrated
Ian Greenleigh
@dbdriscoll Thanks, that means a lot.
Brett Hurt
@dbdriscoll Thanks much, Dan! We are just getting started and appreciate all of your support.
Ian Greenleigh
FB LIVE in 30! @contextjunkie will take us on a wild ride through our top 5 features, and answer your questions. I'm trying to get him to wear a black turtleneck but no promises. See you soon! https://www.facebook.com/datadot...
Ian Greenleigh
Golden Kitty Awards dataset! https://data.world/producthunt/2...
Arlo Gilbert
Really impressive tools to solve real problems. I was fortunate enough to try it during beta and it just keeps getting better.
Brett Hurt
@arlogilbert Thanks so much, Arlo - we appreciate your kind words. We are launching a lot of functionality each and every week, so it will keep getting better and better quickly!
O O
Very cool product @contextjunkie !!! πŸ‡±πŸ‡§?😁
Joe Boutros
@sandrojazzar thanks!!! pretty close, πŸ‡ͺπŸ‡¬ 😁
foo
This is exciting : now that it's easier to build neural networks thanks to tensorflow, finding big enough amount of data is the real pain to implement AI as a small company. Do you have / intend to add any mechanism that could help to add labels on data?
Joe Boutros
Hey @oelmekki, excellent question! The rise of ML and deep learning frameworks like tensor flow is super exciting. One of the cool things about linked data is how robustly datasets can be joined and extended to do things like add labels. Definitely stay tuned for more on that topic! data.world is also a collaborative place where the definition of "dataset" goes beyond just the data itself. We love to see people sharing their own analysis, questions, projects, and even labels along with data and using dataset discussions to compare notes. If lots of folks do this for the same dataset, the dataset becomes an even more valuable resource for everyone. I've spoken to data scientists who were frustrated by the fact that labels aren't as widely shared as unlabeled data, so I'm excited to see that start to happen!
Brett Hurt
@oelmekki Thanks Olivier, there is no doubt that data.world will eventually become the foundation for ML/AI projects - it has been a part of our vision since the beginning. We hope you decide to become a part of the largest public works project in history on data - that is how we are going to bridge to the "Star Trek" future.
Ryan Kennedy
Amazing idea guys. The execution so far is spot on and your team is world class. More APIs!
Brandon Gadoci
@ryankennedy Thanks so much Ryan!
Joe Boutros
@ryankennedy Thanks!! I'll be sure to let you know when we're ready!
Shad Reynolds
@ryankennedy Thanks! More API's coming soon ;)
Shad Reynolds
@ryankennedy Just put together a blog post on the current state of API's β€” https://meta.data.world/apis-at-...
Joe Boutros
Hey Hunters! Joe from data.world here. As a special treat for you, we've compiled the most comprehensive PH dataset ever released.(https://data.world/producthunt/p...). 2 years of posts, votes, taglines, and more! Dig in - we can't wait to see what insights you find! And a bit about the platform... We're all about helping data people solve problems faster, so we've built a collaboration platform to address a glaring, urgent need... With hundreds of killer visualization and analysis tools out there, why are we stuck in the stone age when it comes to the most frustrating and time-consuming parts of any data project: finding, understanding, preparing, and sharing data? data.world tackles this issue by helping you discover, explore, contribute, and share/publish---better, faster, easier, and all in one place. Discover: Browse thousands of open datasets contributed by organizations and data people from all over the world. Explore: See the data's "story" alongside the data itself. Preview the data before you dive in. Query within and across datasets, and create exploratory visualizations with just a few clicks. Contribute: Join the discussion with an international community of data people. Post hunches, share analysis techniques and insights, and find new collaborators. Share / Publish: Upload from your computer or pull down from the cloud. Automatically enhance your data, make it instantly queryable and joinable to other datasets. Showcase your work and build out your data work portfolio. Please don't hesitate to share any questions or feedback right here. We'll be online 😎 Thanks, Hunters, and welcome to the social network for data people! -Joe Boutros https://data.world/jboutros
Patrick Adiaheno
This is a great idea with an A+ team! I did research on oil and gas recovery at UT-Austin. There is a ton of data collected that no one can access. Sometimes it feels like you are almost starting from scratch. Having a unified source of data like data.world could really help the petroleum industry and others. We should all support this startup!
Ian Greenleigh
@adiaman2000 Thank you, Patrick. It sounds like you do interesting work, and I'm glad the platform is useful to you.
Yousif Aldujaili
This couldn't be more useful for guys like myself getting started out with trying to implement data & data driven decisions into an organisation. Thank you!
Matt Laessig
Thanks for the love @seffa121, we definitely think the world will benefit by more easily finding, understanding, and being able to collaborate in the world's data which currently is very fragmented, siloed, and lacks the capture of context and improvements that others have already done with it.
Venkat Janapareddy
Data.world has built an amazing technology on open data and very easy to use compared to their competitors. I have used it personally and I have also asked a popular machine language platform guys to play with data.world and there were very impressed with what they have done so far and how easy it is to integrate. The number of data sets they have is growing day by day and Its very easy to integrate with private data.
Ian Greenleigh
@venkatjanapared That's what we like to hear! Thanks
Brandon Gadoci
@venkatjanapared Thanks for the kind words Venkat!
JT Singh
Kaggle also is a social network for the data community...how does this differ from Kaggle?
Joe Boutros
Thanks for the question, @jt_singh! At data.world our community is 100% focused around collaboration and the efficiencies that can be unlocked when people and data start working together. Our aim is to get data people of all stripes working together - from journalists and designers to data scientists and researchers. Kaggle was a true pioneer in the area of machine learning competitions, so you can see the major difference in approach - collaboration vs. competition, appeal to all data people vs. those with a specific machine learning focus.
Ken Kaczmarek
Love the concept! Data collaboration is still in the dark ages (email, sftp, csv, etc.). Do you have (or are you planning to have) an API to grab data sets? We're working on an web-based data prep/etl tool and love the idea of helping others connect to and use opendata (or pipe into data.world).
Shad Reynolds
@wanderslth We are absolutely working to make using our data available externally. We will likely have a number of different API's in the coming months. We've already open sourced a JDBC driver (https://github.com/datadotworld/...) which allows users to connect and query both SPARQL and SQL against their datasets. Expect to see more APIs in the coming months. And if you have thoughts on what this should look like, please reach out directly to help@data.world. Thanks :)
Shad Reynolds
@wanderslth I just put together a blog post about the current state of API's β€” https://meta.data.world/apis-at-...
Ken Kaczmarek
@shadr Great to see; thanks! Launching our own initial public API next week, so this is quite revelant. Wishing you guys much success in tackling this market!
Alex Weber
Guys' you are awesome! This is just what I have been looking for!
Gabriela Swider
My favorite data.world feature is the New Knowledge visualizations; I love how I can easily play around with the sample queries and look at the data in a variety of graph formats, then copy the embed code to paste the viz I created in my dataset summary or discussion threads. Here's a quick gif showing how to copy/paste your viz to spice up your dataset: https://data.world/gswider/embed...
Ian Greenleigh
@gabriela_swider That's definitely in the running for my top feature, too.
Selene
Join us at 2pm CT for a live demo with Joe Boutros, Director of Product Engineering! :) https://www.facebook.com/datadot...
Jonathan Ortiz
The post-truth world in which we seem to have found ourselves is pretty scary. So I'm happy we're doing what we're doing to enable evidence-based solutions to the world's problems.