Posts

Excel Regular Expression Parsing

Sample piece of code to parse Excel files with regular expressions using Excel Query (The first piece does not loop through all the rows to make it more efficient and fetches the resultant set based on the first row. The second piece goes through all the columns and finds a match): //This piece uses ExcelQuery --> using System; using System.Collections.Generic; using System.Linq; using System.Web; using System.IO; using System.Text; using System.Web.UI; using System.Text.RegularExpressions; using System.Web.UI.WebControls; using Scanning; using System.Data; using Innovative.Data; //Install ExcelQuery namespace WebApplication1 {     public partial class WebForm1 : System.Web.UI. Page     {          protected void Page_Load( object sender, EventArgs e)         {         ...

Elasticsearch Notes

Image
Been recently playing with a lot of open source tool sets to figure out core solutions for different product ideas that I have. One of the recent technologies I have used is the Elasticsearch tool. Elasticsearch is basically a NoSql based indexing solution that allows one to use Lucene indexes on top of massive data sets especially string based documents. This blog post is just a bunch of notes that I have compiled. What is Elasticsearch? Elasticsearch is a document store with each document stored as an index in a cluster with multiple shards. Sharding is basically a concept of partitioning data based on some metric within the data: Now Elasticsearch exposes an http based request-response to query the individual documents stored in the index. In my case I created a 2 node cluster as shown in the following image: After this step I created an index called imdb_search . Initially wanted to create a Graphing tool to showcase the connections that I had in facebook and the relatio...