Ethereum is an open blockchain platform that allows anyone to build and use decentralized applications. In 2013, Vitalik Buterin, the founder of Ethereum, published the Ethereum whitepaper, which outlined the core of its design and technical aspect of Ethereum, laying the foundation for the vigorous development of blockchain applications and ecosystems in the future.
The primary goal of Ethereum was to improve upon the functionality of Bitcoin, not just as a digital payment system. The main difference between the two is that Ethereum can store code that can be executed. This core innovation is also called smart contracts, which means that any developer can write executable decentralized applications (dApps) on the Ethereum blockchain to implement various business needs and social contracts.
With the emergence of smart contracts, users can conduct secure, transparent, and automated transactions and collaborations without the need for trust or intermediaries. This expanded the utility and functionality of blockchain networks.
However, the development and practical application of Ethereum have inevitably encountered some challenges, with the most significant being the scalability issues. When the cryptocurrency market experienced rapid growth and a surge in transaction volume, the performance of the Ethereum blockchain becomes insufficient to meet the demand. Therefore, Ethereum needs continuous innovation and evolution to cater to its growing user base.
This lesson provides a comprehensive overview of Ethereum’s technological evolution, explores how Ethereum has emerged as a pioneer in the blockchain space, and guides you to understand the scalability challenges facing Ethereum and the corresponding solutions.
Before the emergence of Ethereum, the focus of blockchain development was primarily limited to digital currencies and payment systems, using decentralized blockchain ledgers to record transactions. Taking Bitcoin as an example, the Bitcoin network can be seen as a state transition system that uses the Unspent Transactions Output (UTXO) model. UTXO refers to the remaining amount of tokens from a transaction that can be used in another transaction. It can be interpreted as the remaining unspent balance after someone gives you some amount of money.
Each time a miner produces a new block, it represents a change in the state of the Bitcoin network, as the new block contains some valid transactions that consume some existing UTXOs and create new UTXOs. These new UTXOs become the latest state of the Bitcoin network, which reflects the number and distribution of Bitcoin held by all users.

Previous state + valid transactions in a new block = Current state

State transition of UTXO
In everyday life, people are more familiar with the account model used in traditional banking systems, where a blockchain address corresponds to an account. Each transaction involves transferring funds or data from one account to another, and the account balance is the asset you own. When conducting transactions using the UTXO model, there is no restriction on address, meaning that you can use a different address for each transaction. Your asset is the sum of all UTXOs associated with you, rather than the balance of a specific address. Therefore, UTXO has better parallelization, while the account model is more likely to execute sequentially, resulting in limited performances. Although the UTXO model offers better privacy and scalability, it is less intuitive and difficult to implement complex logical operations. For example, the programming language of Bitcoin has the following limitations:
Non-Turing complete
Although the Bitcoin scripting language supports a wide variety of functionalities, it does not support all computations, particularly lacking support for loops. In theory, any loop operation can be achieved by repeatedly executing the underlying code through multiple layers of IF statements, but this will lead to very low operational efficiency.
Difficult to determine the value
UTXO scripts cannot finely control the amount that can be withdrawn. For example, user A needs to hedge the price risk and user B acts as a counterparty. The two parties deposit assets worth $1,000 into a contract. After 30 days, user A gets back $1,000 worth of Bitcoin, and the remaining portion is sent to user B. This would require an oracle to determine the dollar value of 1 Bitcoin, forcing the oracle to hold many UTXOs of different denominations.
Lack of state flexibility
UTXO can either be spent or unspent. It is difficult to create options contracts, decentralized quoting systems, cryptographic commitment protocols, and other applications that require multi-stage states (such as “in-use”). Therefore, UTXOs can only be used to build simple one-time contracts, rather than more complex “stateful” contracts in decentralized protocols.
Unable to see blockchain data
The UTXO model does not have visibility into blockchain data, such as random numbers, timestamps, and the hash of the previous block. This limitation restricts the development of gaming and other types of dApps.
To address the shortcomings of Bitcoin in smart contract applications, Ethereum abandoned the UTXO model and adopted an account model, similar to personal bank accounts, where a database records the balance of each account and automatically updates it with each transaction.
There are two types of accounts on Ethereum: externally owned accounts controlled by private keys and contract accounts controlled by smart contract code. Ether serves as the medium of exchange within the Ethereum network.
In the development of blockchain, Ethereum was the first blockchain network to facilitate the widespread adoption of smart contracts and dApps. Smart contracts on Ethereum are typically programmed using the Solidity language. They are programs executed on the blockchain with features such as automation, decentralization, transparency, and immutability, making them suitable for implementing various business logic and transaction rules. Users can securely and reliably transact without the need to trust third parties.
Smart contracts can be used to develop various types of dApps, making the Ethereum network essentially a globally shared computer. Anyone can access data and edit and operate different types of applications on this computer to meet their needs.
At first glance, it may not seem particularly unique, as it is just running software programs, right? Storing data and executing web application code on centralized servers or installing software on personal computers can seemingly achieve the same application functionalities as smart contracts.
However, this approach comes with limitations and risks such as data loss, tampering, and service disruptions. The emergence of smart contracts provides unprecedented security, verifiability, and censorship resistance to traditional applications. Specifically, Ethereum’s smart contracts have several key differences compared to general web applications and client-side software:
Ethereum Virtual Machine (EVM) is a Turing-complete computer that runs on the Ethereum blockchain. It executes the code of smart contracts and modifies the state of the blockchain. An analogy to understand it is the operating system (OS) running on personal computers or smartphones, where only within this specific environment can smart contracts and Ethereum applications be executed.
The EVM operates in an isolated environment, unaffected by external interference and without impacting the underlying hardware. It works by compiling the code of smart contracts, usually written in the Solidity language, into a bytecode instruction set called EVM bytecode. The EVM then executes these instructions step by step.
The execution of the EVM results in changes to the state of the blockchain, such as transferring tokens, storing data, or triggering events. To prevent malicious smart contract code from causing infinite loops or crashing the Ethereum network, each instruction consumes a certain amount of gas. If there is insufficient gas, the operation cannot be performed.
Gas fee refers to the transaction fee paid in Ether (ETH) on the Ethereum network for executing smart contracts or transferring tokens. It is used to reward miners who maintain the network. The cost of the gas fee depends on the complexity of the transaction and the congestion level of the network, typically measured in Gwei (1 Gwei = 0.000000001 ETH).
Traders can set their own gas price (the price per unit of gas) and gas limit (the maximum amount of gas they are willing to pay) to adjust the speed and cost of their transactions.
The formula for calculating gas fee on Ethereum is as follows:
Gas Price * Gas Limit = Gas Fee
Gas price represents the unit price of the fee. If the gas price is too low, the priority of the transaction may be lower and miners may ignore it.
Gas limit refers to the quantity of fees. If the gas limit is too low, it won’t affect the transaction’s priority, but the transaction may fail due to incomplete execution, and the paid gas fee will not be refunded.
Under EIP-1559 (Ethereum Improvement Proposal), the gas price includes a base fee that is dynamically adjusted based on network demand and block size. Transactions with gas prices set above the base fee are guaranteed to be processed, and the ETH will be burned to reduce the token supply in the Ethereum network.
Tips are optional additional fees that users are willing to pay to miners or validators. Users can set their own values for tips, which can help improve the priority of their transactions or expedite transaction speed during network congestion.
Due to Ethereum’s gas mechanism, the computational resources and storage space of the blockchain network can be appropriately priced, which can prevent smart contracts from executing infinite wrong loops or being attacked by malicious actors.
Prior to the Ethereum Merge in September 2022, both Ethereum and Bitcoin relied on the Proof of Work (PoW) consensus mechanism to facilitate the normal operation of their blockchain networks.
Devices participating in mining were required to solve a complex cryptographic puzzle by using computational power to calculate a hash value that meets specific requirements. This process consumed significant computational resources and electricity, and the difficulty of the puzzle continued to change with the network computing power. Users providing computational power were referred to as miners.
By continuously attempting to calculate the hash value, miners had the opportunity to find a hash value that meets the requirements. Miners could then package the pending transactions into a block and broadcast the block to the entire network for validation and confirmation. If the block was confirmed, the miner would receive block rewards and transaction fees.
After the Ethereum 2.0 upgrade, Ethereum adopted the Proof of Stake (PoS) consensus mechanism for mining. Nodes that validate transactions are required to stake a certain amount of Ether as collateral in order to compete for the right to produce new blocks.
The purpose of staking is to impose economic risks on nodes so that nodes attacking the network will have their staked assets slashed, leading to losses. This serves as a deterrent against malicious behaviors. The more Ether a node stakes, the higher the probability of gaining the right to produce new blocks.
The nodes responsible for producing blocks need to package pending transactions, generate new blocks, and broadcast them to the network for validation and confirmation. If a block is confirmed, the node receives block rewards and transaction fees, as well as certain returns based on the amount of staked assets.
The mining mechanism plays a crucial role in the Ethereum network, primarily reflected in the following aspects:
Ensuring transaction reliability and security
Mining involves adding new transactions to the blockchain, increasing the number of blocks, and confirming the legitimacy of these transactions by obtaining proof of work (currently proof of stake). Mining makes it difficult for attackers to maliciously tamper with transactions, ensuring transaction reliability and security.
Maintaining decentralization of the blockchain
The mining mechanism can prevent any single entity or organization from controlling the entire Ethereum network, as mining requires a significant amount of computing power (currently staked tokens instead) and attackers would need substantial resources to gain control. This ensures the decentralization of the network and prevents monopolistic control by any individual or entity
Providing incentives for participation
Mining allows participants to earn rewards by contributing resources. This incentivizes users to engage in the construction and operation of the network, thereby ensuring its smooth functioning. 
Compared to PoW mining, PoS mining requires significantly less computing resources and power consumption, making it better suited to meet the development and scalability needs of the Ethereum network.
Ethereum’s smart contracts have enabled many important applications, including:
DeFi (Decentralized Finance)
DeFi is one of the most successful applications of Ethereum’s smart contracts. It leverages smart contracts to establish a range of decentralized financial protocols such as MakerDAO, Compound, Uniswap, and more. These protocols enable financial services like lending, trading, insurance, asset management, etc.
NFTs (Non-Fungible Tokens)
NFTs are digital assets based on Ethereum’s smart contracts, providing uniqueness and indivisibility through smart contract functionality. NFTs find applications in various fields such as art, music, gaming, virtual real estate, and more.
DAOs (Decentralized Autonomous Organizations)
A DAO is a decentralized autonomous organization based on Ethereum’s smart contracts. It allows members to make decisions on internal matters through voting, such as budget allocation, personnel management, investment decisions, and more. DAOs operate in a truly decentralized and democratic manner and can be applied to communities, companies, foundations, and other organizations.
Token issuance
The ERC-20 standard on Ethereum specifies the issuance and trading of tokens, enabling the creation of one’s own tokens that can be traded on the Ethereum network.
Decentralized storage
There are also some distributed file storage protocols based on smart contracts on Ethereum. For example, the Swarm protocol allows users to store files in a distributed manner on the network, making them more secure and reliable.
Cloud computing
There are also some decentralized computing platforms based on smart contracts on Ethereum, such as Golem and iExec. These platforms allow users to rent computing resources to run various applications, thus achieving more efficient computing and data processing.
Digital identity
EIP-4361 introduces a standardized framework for Login with Ethereum, allowing users to log in to dApps using their Ethereum addresses instead of traditional usernames and passwords. This authentication mechanism enhances user privacy and security while reducing intermediaries’ involvement. \
In addition to the aforementioned applications, Ethereum smart contracts can enable a wide range of other use cases, such as gaming, social networking, and supply chain management, among others. Due to the programmability of smart contracts, the application scenarios on the Ethereum platform continue to expand, offering significant potential for future development.
Ethereum has sacrificed some measure of performance in pursuit of decentralization and security. Due to technical limitations, Ethereum has a block generation time of about 12 to 15 seconds, with each block having a size limit of 15 MB. As a result, Ethereum can only process about 15 transactions per second.
However, as the largest smart contract network in terms of ecosystem and development community, Ethereum has experienced a significant increase in the number of unique wallet addresses with a non-zero balance on its blockchain, growing more than tenfold in the past five years. The surge in different protocols and user activities has overwhelmed the existing design, exacerbating Ethereum’s scalability challenges.
To improve Ethereum’s scalability, the development community has proposed various solutions, including sidechains, rollups, sharding technology, and diverting traffic to other EVM-compatible blockchain networks to alleviate congestion.
In this lesson, we have delved into the technical architecture and principles of Ethereum, as well as how the innovation of smart contracts harnesses the potential of blockchain, enabling a diverse range of decentralized applications and ecosystems to facilitate various business needs and social contracts.
However, the rapid growth of Ethereum has brought to light the issue of low scalability. Addressing the scalability challenge has hence become a top priority in order to meet the increasing user demands.
In the next lesson, we will explore different scalability solutions and their applications within the ecosystem, including sidechains, rollups, and various competitors to EVM. We will also analyze their respective advantages and disadvantages, providing you with a better understanding of the current development status and future trends of blockchain technology.
| 🎥・Main Video 
 | 📄・Related Articles |