BigSemantics:
taming the wild web

BigSemantics is an open source software architecture for developing powerful new interaction paradigms and other applications based on supporting metadata semantics from diverse information sources. BigSemantics includes the meta-metadata language (including inheritance mechanisms and code generators), a repository of meta-metadata wrappers, the BigSemantics service, and a powerful runtime library for clients and servers.

Wrappers are authored in the meta-metadata language to support a life cycle of metadata semantics for curation, information-based ideation, and other tasks. This metadata life cycle includes acquisition, presentation, manipulation (by humans and programs), storage, and transmission. Meta-metadata wrappers specify: (1) data models for the corresponding metadata, (2) rules for extracting metadata from particular information sources, (3) operations on metadata, and (4) presentation guidelines, including semantic zooming.

We define an information source as a type of document, differentiated from others by template-driven structure, published by a web site or digital library, and functionally accessed by URL pattern, MIME type, or other selector. These meta-metadata wrappers serve as an abstraction layer between the information sources and users, supporting information curation and interaction applications, such as IdeaMÂCHÉ, Metadata In-Context Expander (MICE), and TweetBubble, for desktop, pen+touch, and mobile devices.

BigSemantics comes with a large yet growing repository of wrappers, supporting a wide range of information sources, including weather forecasts, Google Books, Amazon Products, games, movies, blog posts, RSS, and so on. You are welcomed to contribute to the wrapper repository! See "getting involved" for more information.

Check out the BigSemantics wiki at github for further introductions, scenarios, and instructions on getting started. Set up a development environment, and perform the tutorials to get your feet wet.

publications
Qu, Y. Supporting Ideation by Integrating Exploratory Search, Browsing, and Curation , Proc. ACM Conference on Human Information Interaction and Retrieval (CHIIR) 2016, 361-363. http://dx.doi.org/10.1145/2854946.2854948
Jain, A., Lupfer, N., Qu, Y., Linder, R., Kerne, A., Smith, S. M., Evaluating TweetBubble with Ideation Metrics of Exploratory Browsing, Proc. ACM Creativity and Cognition 2015. [28%]. Best Paper Nomination. http://dx.doi.org/10.1145/2757226.2757239 [video]
Qu, Y., Kerne, A., Lupfer, N., Linder, R., Jain, A., Metadata Type System: Integrate Presentation, Data Models and Extraction to Enable Exploratory Browsing Interfaces, Proc. ACM Engineering Interactive Computing Systems (EICS) 2014, 107-116 [18%]. http://dx.doi.org/10.1145/2607023.2607030 [video]
Kerne, A., Qu, Y., Webb, A.M., Damaraju, S., Lupfer, N., Mathur, A. Meta-Metadata: A Metadata Semantics Language for Collection Representation Applications, Proc ACM Conference on Information and Knowledge Management, 1129-1138. [12.7%] http://dx.doi.org/10.1145/1871437.1871580
Kerne, A., Damaraju, S., Kumar, B., and Webb, A. Meta-Metadata: A Semantic Architecture for Multimedia Metadata Definition, Extraction, and Presentation, Poster and Demo Proceedings of the 3rd International Conference on Semantic and Digital Media Technologies.