Big data refers to the dynamic, large and disparate volumes of data being formed by people, tools and machines it requires new, original and scalable equipment to collect, host and analytically process the vast amount of data gathered in order to derive real-time business insights that relate to customers, risk, profit, performance, productivity management and enhanced shareholder value. A good understanding of Hadoop Architecture is required to leverage the power of Hadoop. This list primarily includes questions related to Hadoop Architecture, Hadoop and Hadoop Distributed File System (HDFS). Clusters of objects are formed so that objects within a cluster have high similarity in comparison to one a further, but are very dissimilar to objects in other clusters. Clustering is commonly used to search for unique grouping within a data set. AIDS disease is a major health problem and it is the leading causes of death during the world. Early detection of AIDS disease has become an important issue in the medical research fields.