Home
Books
Data Processing and Analysis Techniques
1
Step-by-step guide to building your own LLM.
Learn coding techniques for large language models.
Pretrain and fine-tune for specific tasks.
Understand attention mechanisms and architecture.
Develop a chatbot that follows instructions.
Includes free eBook with print purchase.
Requires intermediate Python and machine learning knowledge.
Average Rating out of 5:
Price Range: $47.37
2
Illuminates how computers and software truly function.
Engaging storytelling builds understanding layer by layer.
Revised edition includes new chapters and graphics.
Explores the heart of smart devices: CPU.
Companion website enhances learning with animated graphics.
Teaches computing concepts through relatable analogies.
Unveils the digital revolution's fundamental essence.
Average Rating out of 5:
Price Range: $28.19
3
Comprehensive guide to data-intensive application design.
Explore scalability, consistency, and reliability challenges.
Examine pros and cons of various technologies.
Understand trade-offs in architecture decisions.
Learn from major online service architectures.
Navigate tools like NoSQL and relational databases.
Make informed decisions for modern applications.
Average Rating out of 5:
Price Range: $26.33
4
201 real interview questions from top tech companies
Step-by-step solutions for better understanding
Focus on essential Data Science interview topics
Tips for resume crafting and networking
Case study practice from renowned companies
Insights from Ex-Facebook data professionals
Avoid counterfeit books; buy new only
Average Rating out of 5:
Price Range: $42.75
5
Comprehensive guide for data manipulation in Python.
Updated for Python 3.10 and pandas 1.4.
Practical case studies for effective data analysis.
Learn pandas, NumPy, and Jupyter essentials.
Ideal for Python programmers and data analysts.
Access data files and materials on GitHub.
Create informative visualizations with matplotlib.
Average Rating out of 5:
Price Range: $43.99
6
Comprehensive overview of data engineering landscape.
Learn to plan and build robust data systems.
Evaluate technologies through the data engineering lifecycle.
Understand data generation, ingestion, and orchestration concepts.
Design architecture using best practices and frameworks.
Incorporate governance and security in data processes.
Cut through marketing hype for better technology choices.
Average Rating out of 5:
Price Range: $43.99
7
Comprehensive overview of data engineering practices.
Learn to build robust data systems.
Evaluate technologies through a data lifecycle framework.
Understand key concepts: ingestion, transformation, storage.
Assess data problems with best practices.
Incorporate governance and security in engineering.
Available PDF included with your purchase.
Average Rating out of 5:
Price Range: $21.43
8
Step-by-step guide to building LLMs.
Clear explanations with diagrams and examples.
Code a GPT-style language model from scratch.
Fine-tune LLMs for specific tasks and data.
Prepare datasets suitable for effective training.
Develop your own personal assistant LLM.
Requires intermediate Python and machine learning knowledge.
Average Rating out of 5:
Price Range: $43.99
9
Explores common statistical fallacies and paradoxes.
Improves decision-making using data and statistics.
Accessible introduction to statistical thinking for everyone.
Real-world examples from health and politics included.
Covers correct and incorrect interpretations of data.
Engaging style appeals to curious readers.
Highlights importance of statistics in modern challenges.
Average Rating out of 5:
Price Range: $3.99
10
Comprehensive guide for practical statistics in data science.
Learn essential statistical concepts using R and Python.
Bridges gap between statistics and data science practices.
Exploratory data analysis as a key preliminary step.
Random sampling techniques to improve dataset quality.
Principles of experimental design for definitive answers.
Statistical machine learning methods to learn from data.
Average Rating out of 5:
Price Range: $45.25
11
Unlock insights from your organization’s data.
Intuitive access to data for informed decisions.
Step-by-step guide by expert Jack Hyman.
Learn data modeling and visualization techniques.
Create compelling reports for decisive action.
Master advanced functions like DAX and integrations.
Transform data into actionable business strategies.
Average Rating out of 5:
Price Range: $22.47
12
Practical guide for creating Power BI dashboards.
25 chapters covering various chart types.
Includes 40 visuals from AppSource gallery.
Step-by-step instructions for visuals setup.
Tips for effective data preparation techniques.
Quizzes to reinforce learning material.
Suitable for both analysts and nontechnical users.
Average Rating out of 5:
Price Range: $41.33
13
Learn R programming through practical projects.
Write your own functions and simulations.
Master data manipulation and analysis techniques.
Use R programming tools effectively.
Enhance skills for real-world data science tasks.
Develop fast, vectorized R code.
Utilize R’s package system and debugging tools.
Average Rating out of 5:
Price Range: $29.89
14
Master Power Query M for data transformation.
Learn fundamentals and advanced concepts effectively.
Hands-on examples for real-world application.
Optimize performance and handle errors efficiently.
Practical strategies for data processing techniques.
Perfect for analysts and business intelligence users.
Automate data cleaning processes to save time.
Average Rating out of 5:
Price Range: $41.37
15
Engaging illustrations simplify complex statistical concepts.
Covers essential topics like probability and hypothesis testing.
Humorous approach makes learning enjoyable and memorable.
Updated edition with new material included.
Perfect for beginners seeking statistical literacy.
Learn through relatable and entertaining examples.
Ideal for students and curious minds alike.
Average Rating out of 5:
Price Range: $2.99
16
Comprehensive guide to database internals.
Explores modern distributed data systems.
Covers storage engines and their classifications.
Discusses efficient storage building blocks.
Explains complex distributed system communication patterns.
Examines consistency models in database clusters.
Includes resources from open-source databases.
Average Rating out of 5:
Price Range: $36.49
17
Hands-on learning with popular ML frameworks.
Covers Scikit-Learn, Keras, and TensorFlow.
Intuitive understanding of machine learning concepts.
Numerous code examples and practical exercises.
Explore neural network architectures and techniques.
Suitable for programmers with basic experience.
Updated third edition with recent breakthroughs.
Average Rating out of 5:
Price Range: $51.29
18
Comprehensive overview of data engineering landscape.
Plan and build robust data systems effectively.
Evaluate technologies through the data engineering lifecycle.
Learn data generation, ingestion, and orchestration techniques.
Incorporate data governance and security measures.
Apply best practices for data architecture design.
Understand critical concepts for any data environment.
Average Rating out of 5:
Price Range: $41.79
19
Immersive learning experience for data storytelling.
Build confidence in creating impactful visualizations.
Over 100 hands-on exercises to practice skills.
Learn from real-world examples and detailed illustrations.
Practical guidance for applying lessons at work.
Develop skills to inspire and influence action.
Master data storytelling for exceptional communication.
Average Rating out of 5:
Price Range: $26.07
20
Master data analysis in Microsoft Excel.
Learn VLOOKUPS, Pivot Tables, and more.
Unlock career potential with analytical skills.
Practical exercises for real-world applications.
Develop critical analytical mindsets and techniques.
Accessible language for all skill levels.
Enhance decision-making in today’s digital landscape.
Average Rating out of 5:
Price Range: $18.99
21
Hands-on recipes for effective data analysis.
Master pandas 2.x and its advanced features.
Streamline workflows with practical, ready-to-use solutions.
Learn data wrangling, visualization, and optimization.
Integrate pandas effectively with NumPy and databases.
Boost efficiency in large dataset handling.
Ideal for Python developers and data professionals.
Average Rating out of 5:
Price Range: $35.61
22
Step-by-step guide to building LLMs.
Understand LLM design and creation process.
Learn dataset preparation for LLM training.
Fine-tune models for specific tasks effectively.
Use human feedback for instruction compliance.
Build your own chatbot from scratch.
Suitable for intermediate Python and ML learners.
Average Rating out of 5:
Price Range: $17.46
23
Comprehensive guide to database internals.
Explores modern distributed data systems.
Examines storage engine classifications and types.
Analyzes efficient storage organization techniques.
Discusses node communication patterns in distributed systems.
Explores consistency models in database clusters.
Includes PDF with Audible purchase.
Average Rating out of 5:
Price Range: $18.80
24
Comprehensive guide for Power BI beginners. Learn data modeling and visualization techniques. Practical examples of Power BI features included. Free eBook with print or Kindle purchase. Transition from Excel to Power BI easily. Master data-driven storytelling with enhanced visualization.
Average Rating out of 5:
Price Range: $26.80
25
Actionable insights for effective data governance.
Align governance with measurable business outcomes.
Expert guidance from a seasoned chief data officer.
Real-world case studies demonstrating successful implementation.
Practical strategies for executive buy-in and support.
Comprehensive overview from ideation to delivery.
Enhance confidence in data management practices.
Average Rating out of 5:
Price Range: $42.49
26
Learn SQL using MySQL in one day. Hands-on project for practical experience. Concise format for busy individuals. Step-by-step guidance for beginners. Complete process from creation to retrieval. Master SQL through examples and practice.
Average Rating out of 5:
Price Range: $3.99
27
Understand modern data architectures and their evolution.
Learn strengths and weaknesses of each approach.
Differentiate between data warehouse and data lake.
Explore data lakehouse and its benefits.
Clarify data mesh concepts and realities.
Gain insights for better data architecture solutions.
Essential guide for data professionals and architects.
Average Rating out of 5:
Price Range: $50.99
28
Comprehensive guide to transformer models in NLP.
Hands-on approach with practical coding examples.
Learn with Hugging Face Transformers library.
Optimize transformer models for real-world applications.
Techniques for efficient model deployment included.
Covers cross-lingual transfer learning methods.
Updated full-color edition for better clarity.
Average Rating out of 5:
Price Range: $41.60
29
Learn data analysis using Python libraries.
Master Pandas for data manipulation and analysis.
Create stunning visualizations with Seaborn.
Build predictive models using Scikit-learn.
Work with real-world data in practical analyses.
Stay ahead in the growing data analyst field.
Comprehensive training and reference for data science.
Average Rating out of 5:
Price Range: $49.82
30
Concise introduction to data science fundamentals.
Explains evolution and current uses of data science.
Covers data infrastructure and integration challenges.
Introduces basics and applications of machine learning.
Discusses ethical and legal issues in data science.
Offers principles for successful data science projects.
Explores future impact of data science developments.
Average Rating out of 5:
Price Range: $10.83