Sick: Indexed deduplicated binary storage for JSON-like data structures

SICK: Streams of Independent Constant Keys SICK is an approach to handle JSON-like structures and various libraries implementing it. SICK allows you to achieve the following: Store JSON-like data in efficient indexed binary form Avoid reading and parsing whole JSON files and access only the data you need just in time Store multiple JSON-like structures in one deduplicating storage Implement perfect streaming parsers for JSON-like data Efficiently stream updates for JSON-like data The tradeoff for these benefits is somehow more complicated and less efficient encoder. The problem JSON has a Type-2 grammar and requires a pushdown automaton to parse it. So, it’s not possible to implement efficient streaming parser for JSON. Just imagine a huge hierarchy of nested JSON objects: you won’t…

Read more on Hacker News