Data Science for Beginners in Marathi Language.
Part 1 - Python platform installation with Anaconda.org & complete install tutorial on Jupyter Notebook for Python Programming in Marathi Language. Complete download and installation of jupyter notebook for using python.
Marathi Data Science Playlist - • Marathi Data Science Complete
Download Link for Anaconda Installer - https://www.anaconda.com/products/dis...
thanks
Following are few details about Technology & few Modern Terms FYR.
Data science : using scientific methods, algorithms, and systems to extract knowledge and insights from data.
Decision science : for business problems, data science combined with
behavioural science and design thinking to understand end users.
Business intelligence (BI): analyzing and reporting historical data, like sales statistics and operational metrics, to guide strategic decision-making.
Data analysis: inspecting, cleansing, transforming, and modeling data, with the goal of discovering useful information.
Data mining: discovering patterns in data with methods and tools like machine learning, statistics, and database systems.
Exploratory data analysis (EDA): summarizing a dataset’s main characteristics and informing the development of more complex models or logical next steps.
Data engineering: building infrastructure with which data are gathered, cleaned, stored, and prepped for data science.
DataOps: automated, process-oriented methodologies to improve quality and reduce cycle time in data analytics — akin to DevOps for data, with these key differences.
Artificial Intelligence (AI): computer systems that can perform tasks that normally require human intelligence, using human reasoning as a model.
AIOps: DataOps at the intersection of AI and big data, often using machine learning with the intent to feed continuous insights into continuous improvement, and often including collaborative automation, performance monitoring, and event correlations.
Machine learning (ML): A subset of AI in which a system learns from input by identifying patterns in that data, then applies those patterns to new problems or requests, allowing data scientists to teach a computer to carry out tasks rather than programming it step-by-step.
Supervised learning: a subset of ML with a data scientist guiding or teaching the desired conclusion to the algorithm, such as a system learning to identify problems by being trained on a dataset of correctly labeled and characterized problems.
Deep learning: advanced machine learning systems with multiple input/output layers, as opposed to shallow systems having one round of data input/output.
MLOps : akin to DevOps or DataOps, collaboration and communication between data scientists and operations professionals to manage the production ML lifecycle, with increased automation and improved quality per business and regulatory requirements.
ETL (extract, transform, load): data integration from multiple sources, normalized or transformed into a common or standardized format, often to build a data warehouse.
Data Visualization (dataviz): visual representation of text-based information, to help recognize patterns, trends, and correlations and to generally understand the significance of data.
Data Model: defines how datasets are connected to each other and how they are processed and stored.
Data Warehouse: repository where all the data collected by an organization is stored and used as a guide for business decisions.
R: programming language for statistical computing, used by statisticians and data miners for data analysis and developing statistical software.
Python: programming language popular for manipulating and storing data, as well as for general-purpose programming.
SQL (Structured Query Language): declarative programming language used to perform tasks such as updating or retrieving data.
Big data: data sets too large or complex to be dealt with by traditional data-processing software.
Classification: an example of supervised learning in which an algorithm puts new data under a pre-existing category based on characteristics for which the category is already known — for example, classification can be used to determine if a customer is likely to spend over $20 online, based similarity to other customers who have previously spent that amount.
Cluster analysis: like classification, but where the algorithm receives inputted data and finds similarities in the data itself by grouping data points together that are alike, i.e. classification without supervised learning.
Hypothesis testing: use of statistics to determine the probability that a given hypothesis is true; often used in science.
Statistical power: the probability of making the correct decision to reject the null hypothesis when the null hypothesis is false, i.e. higher statistical power reflects lower likelihood of concluding incorrectly that a variable has no effect.
Thanks.
Информация по комментариям в разработке