LIS Blog
OASIS (Online Acquisitions and Selection Information System) is ProQuest's free web-based system for searching, selecting, and ordering print and electronic books for academic libraries.
OASIS users not only enjoy access to the broadest title database in the industry but also benefit from powerful search tools that help locate titles quickly and efficiently. Via OASIS users have access to a range of top ebook platforms in the market – aggregator platforms such as Ebook Central®, R2 Digital Library and EBSCO®* and a growing list of publisher platforms that includes UPSO, Wiley Online Library, Info-Sci, Taylor & Francis, World Scientific, De Gruyter, and Karger. Print book distribution centers in US and UK and a print partnership with INGRAM ensure efficient print order fulfillment.
- Details
Lucene vs Solr
Many people new to Lucene and Solr will ask the obvious question: Should I use Lucene or Solr?
The answer is simple: if you're asking yourself this question, in 99% of situations, what you want to use is Solr.
A simple way to conceptualize the relationship between Solr and Lucene is that of a car and its engine. You can't drive an engine, but you can drive a car. Similarly, Lucene is a programmatic library which you can't use as-is, whereas Solr is a complete application which you can use out-of-box.
What is Solr?
Apache Solr is a web application built around Lucene with all kinds of goodies.
It adds functionality like
- XML/HTTP and JSON APIs
- Hit highlighting
- Faceted Search and Filtering
- Geospatial Search
- Fast Incremental Updates and Index Replication
- Caching
- Replication
- Web administration interface etc
Unlike Lucene, Solr is a web application (WAR) which can be deployed in any servlet container, e.g. Jetty, Tomcat, Resin, etc.
Solr can be installed and used by non-programmers. Lucene cannot.
- Details
PARLIAMENT DIGITAL LIBRARY (PDL)
Parliament Digital Library provides information about various parliamentary documents of Lok Sabha, the House of People. The debates of Lok Sabha, from the First Lok Sabha to the Sixteenth Lok Sabha are placed in this portal. The Reports of several Parliamentary Committees, Presidential Addresses to Parliament, Budget Speeches, and several publications of the Lok Sabha Secretariat are also part of its collections. The treasured and historic legislative debates, tracing the growth and development of modern parliamentary institutions in India, from 1858 to 1952, thereby covering the period of 95 years, is being uploaded periodically. The portal provides the users a single window information retrieval technology with filtered and federated search from multiple searchable resources.
- Details
A data lake is a centralized repository that allows you to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data, and run different types of analytics—from dashboards and visualizations to big data processing, real-time analytics, and machine learning to guide better decisions.
Why do you need a data lake?
Organizations that successfully generate business value from their data, will outperform their peers. An Aberdeen survey saw organizations who implemented a Data Lake outperforming similar companies by 9% in organic revenue growth. These leaders were able to do new types of analytics like machine learning over new sources like log files, data from click-streams, social media, and internet connected devices stored in the data lake. This helped them to identify, and act upon opportunities for business growth faster by attracting and retaining customers, boosting productivity, proactively maintaining devices, and making informed decisions.
Characteristics |
Data Warehouse |
Data Lake |
Data |
Relational from transactional systems, operational databases, and line of business applications |
Non-relational and relational from IoT devices, web sites, mobile apps, social media, and corporate applications |
Schema |
Designed prior to the DW implementation (schema-on-write) |
Written at the time of analysis (schema-on-read) |
Price/Performance |
Fastest query results using higher cost storage |
Query results getting faster using low-cost storage |
Data Quality |
Highly curated data that serves as the central version of the truth |
Any data that may or may not be curated (ie. raw data) |
Users |
Business analysts |
Data scientists, Data developers, and Business analysts (using curated data) |
Analytics |
Batch reporting, BI and visualizations |
Machine Learning, Predictive analytics, data discovery and profiling |
- Details